d 


4 


Psychological Review 


* 


" 


i RicHap L. SoLomon, Editor 


dh, University of Pennsylvania E E 
"ec x» 
3 
às o 
, RA 
‘, 
4 VOLUME 69, 1962 
E! 
- t- 
ai : E 
" a p ^ 
HELEN Orr ELIZABETH S. REED Mary Susan GILLETT 
Managing Editor Advertising Manager Editorial Assistant 
Fs ^ T 


- ar NE. BIMONTHLY BY THE 


/.— AMERICAN PSYCHOLOGICAL ASSOCIATION, INC. 
|. PRINCE ax» LEMON STS., LANCASTER, PA. 7 E 
AND d SIXTEENTH su N. W., WASHINGTON 6, D. C. 


- 73 h d 4 


Jor K. Apams 
RonERT P. ABELSON 
Gorvon W. ALLPORT 
Justin M. AnoNrnEED 
Sotomon E. Ascu 
ALBERT BANDURA 
S. Howard BARTLEY 
DANIEL E. BERLYNE 
Dareir BINDRA 
ABRAHAM H. BLACK 
Epcar F. BORGATTA 
Everetr W. Bovarp 
Gorpon H. Bower 
Joser V. Brapy 
Jack W. BREHM 
Donatp M. BROVERMAN 
Rocer W. Brown 
Joser Brozex 
F. RosERT Brusu 
Rozert R. Busi 
Joun B. CARROLL 
Raymonp B. CATTELL 
Davin H. Grant 
RICHARD CHRISTIE 
RussELL M. CHURCH 
SipxNEv E, CLEVELAND 
Cryne H. Coowns 
LEE J. CRONBACH 
Kart M. DALLENBACH 
James DEESE 
WirtrAM N. DEMBER 
Cart P. Duncan 
J. A. EASTERBROOK 
Warp Epwarps 
James P. Ecan 
CnuanLES W. ERIKSEN 
WiLLiAM K. Estes 
Seymour FESHBACH 
Leon FrsTINGER 
Epwin A. FLEISHMAN 
Epwarp Foncv 
Joun Garro 
WENDELL R. GARNER 
FRANK A, GELDARD 
James J. Grmsox 
Kennetu P. Goopricu 
ALBERT E. Goss 
CLARENCE H. GRAHAM 
Davin A. Grant 
G. ROBERT Grice 
J. P. GurLronp 
HAROLD GULLIKSEN |. 
NORMAN GUTTMAN 


‘Egni 


Editorial Consultants 


Carvin S. HALL 

D. O. Hess 

R. J. HERRNSTEIN 
Ecknuanp H. Hess 
WixrnED F. Hirr 
HERBERT HOCHBERG 
Jurian HOCHBERG 
Lrovp G. HUMPHREYS 
J. McV. Hunt 

Leo M. Hurvicu 
Ray Hyman 
Francis W. Irwin 
James J. JENKINS 
Lyte V. Jones 
MICHAEL Kaun 
Leon J. KAMIN 
Howard H. KENDLER 
Tracy S. KENDLER 
WILLIAM KESSEN 
Leonard S. Kocan 
WOLFGANG KOHLER 
RavMoxp G. KunLEN 
WiLLIAM W. LAMBERT 
DoucLas Howarp LAWRENCE 
DANIEL S. LEHRMAN 
MARVIN LEVINE 

J. C. R. LICKLIDER 
GARDNER LINDZEY 

FRANK A. LOGAN 

ARDIE LUBIN 

R. Duncan Luce 

Davip T. LYKKEN 

CnA:IG MACANDREW 
Kennet MacConQvopALE 
WirLrAM J. McGrrr 

F. Joseren McGuican 
Doxarp W. MacKinnon 
Rozert B. MacLrop 
InviNG MALTZMAN 

GEORGE MANDLER 

SARNOFF A. MEDNICK 


. SAMUEL J. Mzssick 


WALTER MISCHEL 

O. Hopart MOWRER 
Conran G. MUELLER 
Bennet B. Murdock 
Crype E. Nose 
CHARLES E. Oscoon 
HELEN PEAK 

ALBERT PEPITONE 
Cuartes C. PERKINS, Jr. 
CARL PFAFFMANN 


LANCASTER PRESS, INC., LANCASTER, PA. 


Leo J. POSTMAN 
Marcom G. Preston 
Vicror Rarmy 

ORY RAZRAN 
EDWARD RENNER 
FRANK RESTLE ' 
Lorrin A. RicGcs 

Irvin Rock 

Cart R. ROGERS 
Mirrox E. ROSENBAUM 
FRANK RosENBLATT 
Davin L, RosENHAN 
Mark R, ROSENZWEIG 
Burton S. Rosnr v 
Witiam W. RodBloon 
GEORGE E, Rurr 
SEYMOUR B. Sarason 
STANLEY SCHACHTER 
HAROLD SCHLOSBERG 
Joun P. Sewarp 
Rocer N. SHEPARD 
Josera SHOBEN 
SIDNEY SIEGEL 
HERBERT A. SIMON 
Kart U. Smita 

M. BREWSTER SMITH 
Moncrierr H. Smrti 
KENNETH W, SPENCE 
Rocer W. Sperry 
ARTHUR W. STAATS 
LARRY STEIN 

Error STELLAR 

SAuL H. STERNBERG 
Joseru C. Stevens 
Patrick Supprs 
Jonn A. Swers > 
Renato Tacturt 
PHILIP TEITELBAUM 
Han-Luxas TEURER 
Warren S. Torcerson | 
Leona E. TYLER 
Benton J. Unprrwoop 
WILLIAM S, VERPLANCK - 
Victor H, Vroom | 
Epwarp L, WALKER 
SEYMOUR WAPNER 
Rozert W. WEITE 
DrLows D. WICKENS ~ 
Morton WIENER d 
Davi R. WinLiAMS -— 
Junius WIsHNER 
DAVID ZEAMAN 


| 
f 
if 
f 
(| 


CONTENTS OF VOLUME 69 


AMSEL, ABRAM. Frustrative Nonreward in Partial Reinforcement and Discrimination 


Learning: Some Recent History and a Theoretical Extension............... .. 306 
ANDERSON, Norman H. On the Quantification of Miller's Conflict Theory... 400 
BECKER, GonpoN M. Objective Measures of Subjective Probability and Utility....... 136 
BERGER, SEYMOUR M. Conditioning through Vicarious Instigation................--- 450 
Bonrav, C. ALAN. A Comparison of the Power of the U and ¢ Tests.............---- 246 
Bower, Gorvon H. An Association Model for Response and Training Variables in 

Paired-Associate Learning... i. oes saam er incerto RU sual ER nae Ie n 34 
Briccs, Micnaru H., AND G. Barrie Kirro. The Molecular Basis of Memory and 

LDYPe nr ON cT E A Gases cn uns ASRS ERE 537 
BnovERMAN, DonaLp M. Normative and Ipsative Measurement in Psychology........ 295 
CHAMPION, R. A. Stimulus-Intensity Effects in Response Evocation..........--.-+++ 428 
Cornsweet, Tom N. Changes in the Appearance of Stimuli of Very High Luminance. . 257 
Davis, JAMES H. See Restle, Frank 
DxrsE, JAMES. On the Structure of Associative Meaning... .... -55er n n 161 
Deutscn, J. A. A System for Shape Recognition. ....... se eee ee teres 492 
Epwarps, Warp. Subjective Probabilities Inferred from Decisions.........------++- 109 


EPSTEIN, ALAN N. See Teitelbaum, Philip. 
FRINCKE, GERALD. See Johnson, Ronald C. 
GacNÉ, Ropert M. The Acquisition of Knowledge 
Gipson, JAMES J. Observations on Active Touch... 0 n 477 
Grant, Davin A. Testing the Null Hypothesis and the Strategy and Tactics of In- 
vestigating Theoretical Models......... s tt tt eo: 
Gutick, W. L. See Smith W. M. 
HENLE, Mary. On the Relation between Logic and "Thinking 9e im Sein este afl 
Jonwsow, Roxan C. Reanalysis of *Meaningfulness and Verbal Learning" 
Jouxson, Ronatp C., TnowsoN, CALVIN W., AND FRINCKE, GERALD. A Response to 
Stricker’s Comments on “Word Values, Word Frequency, and Visual Duration 
OH Thresholds?..M oi. ccc sis ninne rtr mh hehehe e aera nm rhy 
Jones, Joan E. All-or-None Versus Incremental Learning 
Jones, Joan E. Contiguity and Reinforcement in Relation of CS-UCS Intervals in 


4 4 Classical Aversive Conditioning............... sees La EOM ER 176 
Jones, MansuaLL B. Practice as a Process of Simplification...............+++++++++ 274 
|. KENDLER, HOWARD H., AND KENDLER, Tracy S. Vertical and Horizontal Processes in 
-«Broblenuisol ving «us dein die viet gale Fra tenis id quels. sic e Hehe ARP gr eee 1 
KrNDLER, Tracy S. See Kendler, Howard H. 
Krnsnato, D. R., AND Narr, Jonn P. A Quantitative Theory of Feeling: 1960. ...... 17 
Krrro, G. BARRIE. See Briggs, Michael H. 
LoGaw, FRANK A. Conditional-Outcome Choice Behavior in Ratia siomi dice. eres 467 
Lucr, R. DUNCAN. Comments on Rozeboom's Criticism of “On the Possible Psycho- 


physical Laws... «e UE ME 
MANDLER, GEORGE. From Association to Structure ok 
J. Mr nick, SARNOFF À. The Associative Basis of the Creative Process 
—— Nare, Joun P. See Kenshalo, D. R. $ ae - 
A Dran. Two Components in Bipolar Scales: Direction and Extremeness..... 


iii 


iv "1 CONTENTS 


* 
Pruitt, Dean G. Pattern and Level of Risk in Gambling Decisions................. 187 
Pusots, JR., Benjamin H. An Application of Levine’s Model for Hypothesis Behavior 
I0 Seri Reversal DOBIBHE. use cers: cue eia ER: an ds italia ot diee Seg era vele Hs 241 
REsTLE, FRANK. The Selection of Strategies in Cue Learning....................... 2 329 


REsrLE, FRANK, AND Davis, James H. Success and Speed of Problem Solving by 


Indiyiduals:and/ Groups. i. us soa sce ven Dus ss TG Qe dpa Vises pera orisa nna ema vem eus ess BOO: 
RozEBOOM, WiLLIAM W. Comment. ......iiiiuuuuuuuuuuuusuu. T y 
RozEBOOM, WILLIAM W. The Untenability of Luce’s Principle 542 
RUBIN, Leonarp S. Patterns of Adrenergic-Cholinergic Imbalance in the Functional 

PSY CHOSE Bis xau inna tie mck dg Y RUN WAS Qu sic ssi at euros rtt es SRL at eric RE 15d 501 
SCHACHTER, STANLEY, AND SINGER, JEROME E. Cognitive, Social, and Physiological De- 

terminants of Emotional State...... 62.0000 s LLL LL s sun 379 


SINGER, JEROME E. See Schachter, Stanley. 
Smith, W. M., AND GuLICK, W. L. A Statistical Theory of Dynamic Contour Perception 91 
SOLOMON, RICHARD L., AND TURNER, LUCILLE H. Discriminative Classical Conditioning 


in Dogs Paralyzed by Curare Can Later Control Discriminative Avoidance Responses 
in the Normal State. i... n sassa sirs zi ee au ora LL REA Aran a 202 


i N. The Lateral Hypothalamic Syndrome: 
$ Recovery of Feeding and Drinking after Lateral Hypothalamic Lesions 
THomson, CaLvIN W. See Johnson, Ronald C. 

TuLvinG, ENDEL. Subjective Organization in Free Recall of “Unrelated” Words 
TURNER, LuciLLE H. See Solomon, Richard L. 


WurrE, SHELDON H. A Note on Intensity Generalization and Prothetic Scaling 


y 
1 


Vor. 69, No. 1 


January 1962 


PSYCHOLOGICAL REVIEW 


VERTICAL AND HORIZONTAL PROCESSES IN 
PROBLEM SOLVING * 


HOWARD H. KENDLER 


TRACY S. KENDLER 
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New York University 


The present paper is concerned with 
an approach—and not the approach— 
to the universally appealing but never- 
theless unpopular research area of 
problem solving. Problems of problem 
solving have proved to be particularly 
refractory to psychologists. More 
often than not the uncommon re- 
searcher with the temerity to attack 
some aspect of reasoning retreats to 
more secure and conventional problems 
when he discovers that his sorties fail 
to achieve any impressive victory. As 
a result the literature of problem solv- 
ing is almost chaotic because it is so 
heavily sprinkled with isolated bits of 
information (Duncan, 1959). 

Perhaps the present stage of de- 
velopment of psychology does not 
justify the strategy of investigating 
such a complex phenomenon. Fortu- 
nately, or not, science has no built-in 
traffic lights to inform investigators 
when to proceed. It may be a risky 
and potentially unfruitful gambit to 
investigate problem solving but then 


iAn earlier version of this paper was 
delivered by Howard H. Kendler as an in- 
vited address to the 1960 meeting of the 
Eastern Psychological Association, which 
was held in New York City. The authors 
are indebted to the Office of Naval Research 
and the National Science Foundation for 
their support of the research reported in 


this paper. 


Barnard College 


again it may not be. In addition .to 
the intrinsic interest of the area it does 
offer a challenge to those psychologists 
who are interested in testing the gen- 
erality of any set of theoretical prin- 
ciples stemming from other areas of 
behavior (e.g., learning, perception). 

This paper initially will make fleet- 
ing references to some methodological 
problems with which a researcher in 
the field of reasoning must contend. 
Then a simple pretheoretical model of 
problem solving will be described, fol- 
lowed by a report of research which 
the model generated, and which in turn 
is shaping the model itself. 


METHODOLOGICAL PROBLEMS IN PROB- 
LEM SOLVING RESEARCH 


Anybody who does research is—or 
should be—aware that every decision 
he makes cannot be justified by facts 
or logic. Some decisions must “be 
made on the basis of personal intuition. 
This is particularly true for the re- 
searcher in problem solving who must 
make three strategic decisions which 
cannot help but have profound influ- 
ences on his research and the ideas 
they generate (Kendler, 1961). These 
decisions, which are not completely in- 
dependent, are related to the place of 
problem solving in psychology, the use 
of complex or simple experimental 
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tasks, and the selection of a pretheo- 
retical model to guide research. Con- 
sidering the volitional nature of these 
problems, as well as the current status 
of psychological knowledge, it would 
be both inappropriate and erroneous to 
consider these methodological problems 
as offering only one sensible alterna- 
tive. Adopting this point of view 
would do much to minimize the need- 
less disputation that seems to peren- 
nially surround matters of research 
strategy. 

Accepting the principle that a basic 
research strategy is not simply an out- 
growth of logical and factual consid- 
erations does not reduce one to making 
decisions in either a haphazard or ran- 
dom manner. A given strategy can be 
adopted on the basis of rational con- 
siderations as long as it is realized that 
other reasonable attitudes might lead 
to the adoption of different decisions. 

The history of problem solving in 
particular and Psychology in general 
suggests that problem solving can best 
be conceptualized not as a basic psycho- 
logical process, but instead as one that 
reflects the interaction of more funda- 
mental processes (e.g., learning, per- 
ception, and motivation ). 

If problem Solving is not viewed as 
a unitary process, how is an appro- 
priate experimental situation selected 
to investigate it? One possibility is 
that a problem can be selected from a 
"true life" situation such as trouble- 
shooting electronic equipment. Or 
problems can be invented (Duncker, 
1945; Maier, 1930) that capture the 
flavor, if only partially, of problems 
we meet in everyday life. 

A more analytical approach can be 
taken to the selection of an experi- 
mental situation to investigate problem 
solving. If problem solving is com- 
pounded of elementary behavioral proc- 
esses, then it may be more strategic to 
devise some simple problems in which 


the relationships of fundamental psy- 
chological mechanisms to problem solv- 
ing are highlighted. That is, tasks 
should be devised not to duplicate or 
imitate everyday problems, but instead 
to isolate and magnify the basic mech- 
anisms that operate in such complex 
tasks. 

This analytical approach which is 
favored by the authors suffers from one 
major drawback. How is it possible 
to know the basic mechanisms of prob- 
lem solving prior to their discovery ? 
Obviously, excepting divination, there 
is no method. But this does not pre- 
vent the analytical approach from op- 
erating. The researcher can prejudge 
theoretical issues by formulating a 
model of what he guesses problem 
solving to be like. The model can 
guide the investigator in selecting the 
hypotheses to test, as well as the expe- 
rimental situations in which to test 
them. 

This brings us to the third and most 
important decision a problem solying 
researcher has to make: his choice of 
a pretheoretical model (Koch, 1959). 
A pretheoretical model is not equiva- 
lent to a theory. The criterion of 
validity cannot properly be applied to 
it because essentially a pretheoretical 
model is an informal conception that 
Operates as an analogy (Lachman, 
1960). It is conceivable that different 
models (e.g. learning, perception, in- 
formation theory) can all lead to fruit- 
ful and valid theories of problem solv- 
ing. 

Psychologists have many possibilities 
from which to choose their model. 
These models can be conveniently di- 
vided into two main categories: the 
empirical model that springs primarily 
from experimental data, and the for- 
mal model that is usually generated 
by mathematical or logical systems. 
Among the empirical models that have 
achieved some acceptance are those 
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that are based on introspective findings 
(e.g., the four successive stage model 
of “preparation,” "incubation," “inspi- 
ration,” and finally "verification"), 
the facts of perception, and those of 
learning. Some formal models used 
are those dependent upon stochastic 
models, game theory, and the operation 
of computers. 

The present authors adopted an S-R 
The 
lecision no doubt was influenced by 
professional training and past research 
efforts. But other considerations en- 
tered. For the past 4 decades S-R 
learning psychologists have probably 
been the most active experimental and 
theoretical group in psychology. To 
some, if not a large, extent this can be 
attributed to the fruitful and cleansing 
effect S-R language has upon design- 
ing, reporting, and interpreting re- 
search. S-R language forces the psy- 
chologist to focus his attention on ob- 
jectively defined environmental and 
behavior variables and thus encourages 
the collection of data and the testing of 
ideas. The efforts of S-R learning 
psychologists have supplied a host of 
facts, concepts, and hypotheses that 
can be exploited in an exploratory ex- 
cursion into the realm of problem 
solving. 

The facts and theories of learning, 
however, do not spontaneously coalesce 
to form a model that can guide re- 
search in problem solving. Some se- 
lection must be made. S-R learning 
theory does not represent a single or- 
ganized formulation. Anyone who is 
faiiliar with the systematic orienta- 
tions of Hull (1952), Guthrie (1952), 
Spence (1956), and Skinner (1953) is 
aware of this. Many of these syste- 
matic differences, however, become at- 
tenuated and some even disappear 
when viewed from the distance of prob- 
lem solving behavior. It is possible 
and perhaps even profitable to de- 


velop a learning model for problem 
solving that ignores many of the points 
of disagreement among S-R theories. 

Much of the objection to S-R lan- 
guage stems from the apparent dis- 
crepancy between active, flowing be- 
havior and the inert, static, single S-R 
association. Using S-R language does 
not mean that complex behavior ac- 
tually consists of S-R connections. 
After analyzing the concept of light, 
Toulmin (1953), concludes: “We do 
not find light atomized into individual 
rays: we represent it as consisting of 
such rays" (p. 29). Applying the 
same idea to the concept of the S-R 
association: “We do not find behavior 
atomized into individual S-R associa- 
tions: we represent it as consisting of 
such S-R associations." The concept 
of the S-R association, therefore, must 
be judged not in terms of its ability to 
provide a clear image of behavior, but 
rather in its capacity to represent the 
facts of behavior. 


PRETHEORETICAL MODEL or PROBLEM 
SOLVING 


An S-R model needs to represent 
two important characteristics of prob- 
lem solving behavior. These charac- 
teristics are behavior is continuous, and 
at any one time behavior consists of 
several habits. The terms “horizontal” 
and “vertical” are used to refer to these 
processes; horizontal to the continuity 
of behavior against the dimension of 
time, and vertical to the assumption 
that independent levels of behavior 
(i.e., S-R units) occur simultaneously. 

The assumption that S-R associa- 


tions do not occur in isolation, but 


instead are linked together to form 
integrated, continuous behavior goes 
back many years (e.g., Watson, 1913). 
Today the process is most commonly 
referred to as chaining. Skinner 
(1953) and his associates have de- 
veloped powerful techniques that shape 
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investigatory 


food salivation 


An S-R representation of 
classical conditioning. 


behavior into long, complicated chains. 
The mass of data they have collected 
suggests important principles govern- 
ing habit chaining. There is litle doubt 
that when their quasitheoretica] System 
is exploited fully with autoinstructional 
devices that important insights into 
problem solving behavior will emerge, 
particularly in relation to how an added 
bit of knowledge can trigger problem 
solution. The kind of chaining with 
which the Skinnerians have dealt (i.e., 
adding new S-R units to an already 
functioning chain) does not exhaust 
all the problems associated with the 
horizontal processes of problem solv- 
ing. Of particular importance to prob- 
lem solving is the spontaneous integra- 
tion of separate habits which occurs 
when an organism infers the conse- 
quences of combining previously inde- 
pendent S-R units, This kind of 
chaining was investigated in Kohler’s 
(1925) classical studies of insight and 
in the more controlled reasoning ex- 
periments of Maier (1930). More re- 
cently the authors (Kendler & Kendler, 
1956, 1961; Kendler, Kendler, Plis- 
koff, & D’Amato, 1958) have tried to 
identify some of the important vari- 
ables that enable children to combine 
Separate experiences in order to solve 
an inference-type problem. Much of 
the research reported in this paper will 
be concerned with how mediated stimu- 
lus and response events aid in the for- 
mation of problem solving chains. 
The assumption of vertical processes, 
i.e., the organism responds several dif- 


ferent ways at any one time, is also 
not a novel one. Every psychologist 
is aware that organisms make several 
different responses simultaneously, al- 
though typically only one is attended 
to. Sometimes the different responses 
are interrelated, as is the case between 
the heart and respiration rates of a 
fearful organism. In other cases the 
different responses are independent, 
€.g., à person's conversation is unin- 
fluenced by his tugging at his ear lobe. 
The best laboratory example of vertical 
processes, and one that has much rele- 
vance to problem solving, is shown in 
Figure 1. Those familiar with intro- 
ductory psychology textbooks will 
recognize this diagram as representing 
classical conditioning. Notice that the 
two solid lines indicate independent 
S-R units which are operating simul- 
taneously. One is the tone that ini- 
tiates the “investigatory” response, and 
the other is the food which elicits sali- 
vation. Initially these two associations 
operate in a parallel fashion, but as a 
result of their simultaneous occurrence 
an interaction takes place which is ex- 
pressed by the broken line representing 
the acquired conditioned response. 
Obviously the brief reference to hori- 
zontal and vertical processes in which 
it is assumed fundamental S-R prin- 
ciples operate (eg. discrimination: 
generalization, etc.) presents at best 
the barest skeleton of a model of prob- 
lem Solving. It needs the flesh a 
skin of experimental facts to give it 
solidity and theoretical principles t? 
clothe it in Scientific respectability: 
Let us now review some of the prog 


ress that has been made in this direc- 
tion. 


CONCEPT LEARNING AND UTILIZATION 


Although the primitive model just 
described fails to generate any research 
by itself, it does suggest that individual 
experiments cannot be directed at 
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problem solving in its entirety. There 
are too many aspects to this phenom- 
enon. The researcher, in designing an 
experiment, must scan the entire prob- 
lem solving process and then focus 
upon that segment that promises to 
yield fruitful results and is also amen- 
able to investigation. 

For reasons that will become evident, 
it was decided to compare reversal and 
nonreversal shifts in a simple concept 
learning task. Figure 2 characterizes 
each kind of shift by showing a simpli- 
fied version of an experimental situa- 
tion used with children. The stimuli 
(cups) for their first discrimination 
differed simultaneously on two dimen- 
sions (size and brightness). The sub- 
ject is rewarded for responses to one 
dimension (e.g., large cup is positive, 
small cup is negative). The other di- 
mension is irrelevant. After learning 
the first discrimination, the subject is 
forced to shift to another response. In 
a reversal shift the subject is required 
to respond to the same dimension on 
which he was originally trained, but his 
overt choice has to be reversed, e.g., 
he has to shift from a large cup to a 
small one. For a nonreversal shift the 
previously irrelevant dimension be- 
comes relevant, e.g. black becomes 
positive after large had been positive. 

Buss (1953) reported that college 
students executed a reversal shift more 
rapidly than a nonreversal shift. He 
attributed this superiority to the inter- 
mittent reinforcements that retard the 
progress of a nonreversal shift. For 
example, in Figure 2,? when a subject 


2The purpose of Figure 2 is to clarify 
the meaning of both a reversal and non- 
reversal shift. It would be misleading to 
believe that it represents exactly the meth- 
odology of “reversal-nonreversal” studies 
reported in this paper. For all experiments 
reported, except that of Buss (1953), de- 
signs were used that controlled for fortuitous 
intermittent reinforcements effects in a non- 
reversal shift. 


is making a nonreversal shift from 
large positive to black positive, he is 
reinforced when choosing the large 
black cup in preference to the small 
white cup. This fortuitous reinforce- 
ment of the choice of the large cup 
helps maintain the size discrimination 
and hence retards the learning of the 
brightness discrimination. The rever- 
sal shift group, on the other hand, re- 
ceives no reinforcement of the previ- 
ously correct responses, since they are 
10096 nonreinforced. 

This analysis is at best incomplete. 
The work of Kendler and Vineberg 
(1954) suggested that adult human 
concept learning cannot be represented 
adequately by a single-unit S-R theory 
in which the external stimulus is di- 
rectly connected to the overt response. 
Instead, a mediational mechanism (see 
Figure 3) is required which assumes 
that the external stimulus evokes an 
implicit response which produces an 
implicit cue that is connected to the 
overt response. 

It would be useful to digress for a 


First 
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Examples of a reversal and a 
nonreversal shiít. 
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Fic. 3. A schematic representation of the 
mediational hypothesis. 


moment to comment about the epis- 
temological status of these inferred 
stimulus and response events which are 
enclosed in the rectangle to emphasize 
their hypothetical character. Although 
not directly observable, they are “tied 
to" environmental and behavioral 
events. The basic assumption of the 
mediational hypothesis, at least for the 
time being, is that the implicit stimulus 
and response events obey the same 
principles that operate in observable 
S-R relationships. 

The mediational hypothesis has gen- 
erated confusion. Perhaps the follow- 
ing brief statements will clarify some 
possible areas of misunderstanding, 

1. The mediational hypothesis is 
neither new nor revoluntionary. Meyer 
(1911) and Watson (1913) referred 


to it, and Hull (1930) gave it a more 
formal status by coining the concept 
of the "pure stimulus act." Guthrie 
(1952) has always laid heavy stress 
on a mediational-type hypothesis when 
emphasizing the importance of proprio- 
ceptive stimulation in learning. 

2. The implicit stimulus and re- 
sponse events need not be conceived 
as having an existence independent of 
their relation to independent and 
dependent variables. These implicit 
events are theoretical constructs. Their 
epistemological status is closer to such 
concepts as drive and habit than to 
directly observable stimulus and re- 
sponse events, 

Some mediating events can conceiv- 
ably and probably will be coordinated 
to introspective reports, language be- 
havior, muscular movements, and other 
observable events. Coordinations of 
this sort can be useful in developing 
mediational theory. But such coordi- 
nations are not essential to mediational 
theory. The fact that genes are not 
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directly observable (at least according 
to the geneticists consulted) does not 
interfere with their theoretical and 
practical usefulness. Even if it were 
possible to observe a gene directly, it 
would be necessary to distinguish be- 
tween it as an observable entity and 
as a concept within a nomological net- 
work. It would be unwise, and strate- 
gically shortsighted, to identify media- 
tional events with introspective reports 
or language behavior, or other observ- 
able events. The “validity” of the 
mediational mechanism does not de- 
pend on being coordinated with ob- 
servable events, but depends instead on 
being utilized in a successful explana- 


tory system. 

Figure 4 characterizes reversal and 
nonreversal shifts in terms of both a 
single unit S-R analysis and a media- 
tional one? It would be predicted, 
according to a single unit hypothesis, 
that if fortuitous intermittent rein- 
forcements were eliminated from a 
nonreversal shift, it would occur more 


3 Figure 4 highlights the problem of what 
are the effective stimuli that are associated 
to the overt response in both a reversal and 
nonreversal shift. It is not intended to be 
a detailed analysis of which there may be 
several alternatives. For example, in a 
single unit theory the habit to choose the 
large container might result from learning 
two separate specific habits (e.g. the choice 
of a large black container when coupled 
with a small white one and the selection 
of a large white container when paired with 
a small black one). Another possibility, 
which would be consistent with Spence's 
theory (1936), is that the response is to 
the effective stimulus large since responses 
to the other features of the environment are 
not consistently reinforced. Similarly adult 
subjects in a reversal shift might use the 
mediator size or large or both. The effec- 
tive stimulus which is controlling the organ- 
ism's response must be determined by ex- 
perimentation. The point made here is that 
the general implications of the single unit 
and mediational theories, as discussed in this 
paper, would be the same for a number of 
different effective stimuli. 


- 


rapidly than a reversal shift. The 
reason for this is that at the time of 
the shift the difference between the 
strength of the dominant incorrect 
habit and the to-be-correct habit is 
much greater for the reversal, as com- 
pared to the nonreversal shiít. Con- 
sequently more training will be re- 
quired to make the correct habit domi- 
nant in a reversal shift. According to 
the mediational theory the situation is 
entirely different. A reversal shift 
enables the subject to utilize the same 
mediated response. Only the overt 
response has to be changed. A non- 
reversal shift, on the other hand, 
required the acquisition of a new 
mediated response, the cues of which 
have to be attached to a new overt 
response. Because the old mediational 
sequence has to be discarded and a 
new one formed, the nonreversal shift 
should be executed more slowly than 
a reversal shift. Thus, if it were pos- 
sible to eliminate fortuitous intermit- 
tent reinforcements, then the stage 
would be set for a crucial experiment 
testing the conflicting implications of 
the single-unit and mediational S-R 
theories. The results of a series of 
such crucial experiments (Buss, 1956; 
Harrow & Friedman, 1958; Kendler 
& D'Amato, 1955) have been con- 
sistent with the mediational formula- 
tion in showing that college students 
execute a reversal shift more rapidly 
than a nonreversal shift. It is im- 


4There are two possible ways of analyz- 
ing the superiority of a reversal shift over 
a nonreversal shift within an S-R media- 
tional framework. One is to simply count 
the number of new associations that have to 
be formed. As Figure 4 indicates only one 
new association has to be formed in a re- 
versal shift while two have to be formed 
for a nonreversal shift. Another possibility 
is that a mediating response is more difficult 
to extinguish than is an overt response. 
For the present the formulation can remain 
open-ended until information relevant to 
these two alternatives is gathered. 
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TABLE 1 
Mean NuMBER or TRIALS TO CRITERION ON 
Test DISCRIMINATION FOR SUBJECTS SCORING 
ABOVE AND BELOW THE MEDIAN ON THE 
TRAINING DISCRIMINATION 


Performance on training 


discrimination 
lr M ( 
pn Above Median Below Median 
(slow learners) (fast learners) 
Reversal 24.4 6.0 
Nonreversal 9.0 15.5 


portant to note that in a similar kind 
of problem rats find a nonreversal shift 
easier than a reversal shift (Kelleher, 
1956). Thus, one is forced to con- 
clude that a single unit S-R theory 
accurately represents the behavior of 
rats, while mediational S-R theory is 
required for the concept learning of 
articulate humans. 

The discontinuity between the be- 
havior of rats and college students 
directs one's attention toward the con- 
ditions responsible for the development 
of mediational processes, Somewhere 
on a hypothetical evolutionary dimen- 
Sion between the rat and college stu- 
dent there should be a point where a 
transition is made from a single unit 
to mediational control. An obvious 
place to locate this point would be in 
the behavior of young children, 

A study with kindergarten children 
(Kendler & Kendler, 1959) showed 
that these children as a group executed 
a reversal and nonreversal shift at ap- 
proximately the same rate. One might 
conclude that the point in human de- 
velopment was discovered which was 
psychologically halfway between the 
white rat and the college student, since 
the kindergarten children were neither 
responding in a single unit nor medi- 
ational manner, but instead in some 
compromise fashion. Another possi- 
bility is that the children had reached 
a transitional stage in development, in 
which the task to which they were 


subjected led some to function on a 
single unit basis, and others to operate 
with a mediational mechanism. If half 
of the subjects respond in each way, the 
total results would have revealed no 
difference between the two kinds of 
shifts. 

The second alternative seems to fit 
the data. When the kindergarten chil- 
dren were divided into fast and slow 
learners on the basis of their perform- 
ance in the first problem (training dis- 
crimination), slow learners performed 
during the second problem (test dis- 
crimination) according to the single 
unit theory ; like rats they found a non- 
reversal shift easier. Fast learners, on 
the other hand, performed in accord- 
ance with the mediational theory; like 
college students, they found a reversal 
shift easier. These results were inter- 
preted as demonstrating that these 
kindergartners, taken as a group, were 
in the process of developing mediating 
responses relevant to this task, and that 
Some were further along than others. 

If this interpretation be correct, then 
it would follow that for a group of 
younger (i.e, preschool) children a 
still smaller proportion should develop 
appropriate mediating responses. It 
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would be expected that such a group, 
taken as a whole, would show clearcut 
evidence of the superiority of a non- 
reversal over a reversal shift. An ex- 
periment (Kendler, Kendler, & Wells, 
1960) designed to test this hypothesis 
produced results consistent with this 
prediction; like rats, nursery school 
children found a nonreversal shift to be 
easier than a reversal shift. 

In a very recent study the experi- 
mental procedure was modified so that 
after learning the initial discrimination, 
the children of 3, 4, 6, 8, and 10 years 
of age who served as subjects, had a 
choice of either responding in a rever- 
sal or a nonreversal manner. Under 
such circumstances, it would be ex- 
pected that the proportion of children 
who respond in a reversal manner 
would increase with age. Figure 5 
shows that the percentage of children 
who chose a reversal shift rose grad- 
ually from 37.5 at 3 to 62.5 at 10. 

Generalizing from all of these re- 
sults, it would seem that in their early 
development, children tend to respond 
in a manner consistent with a single 
unit S-R theory. With age, they de- 
velop a tendency to respond in a media- 
tional manner. The last study cited 
suggests that it is, or will soon be, pos- 
sible to ascertain the lawful relation- 
ship governing the course of this de- 
velopment. 

The point of these experiments is not 
to classify children into one of two 
categories: rat-like or human-like. 
Their aim is to lay the groundwork for 
experiments designed to investigate 
the mediational process itself. If one 
wants to investigate mediational proc- 
esses, does it not seem sensible to scru- 
tinize them at the time when they are 
developing? Answering this question 
in the affirmative, it was decided to 
investigate the relationship between the 
hypothesized mediational processes and 
verbal behavior—a relationship every- 


body assumes to be intimate and im- 
portant. 

Particularly relevant to this attempt 
to coordinate verbalization with media- 
tion were observations that during the 
course of the experiments just de- 
scribed, it was not uncommon for chil- 
dren to verbalize spontaneously the 
correct solution while simultaneously 
making an incorrect choice. A few 
children did this for many consecutive 
trials. This observation is relevant to 
the concept of vertical processes. Two 
chains of habits are occurring simul- 
taneously. One has to do with verbal 
response; the other with the overt 
choice. For these children the two 
chains are parallel, that is, they do not 
interact. 

Luria (1957), the Russian psychol- 

ogist, made somewhat similar observa- 
tions in his research with children. He 
explains this sort of phenomenon in the 
following way : 
In the early stages of child development, 
speech is only a means of communication 
with adults and other children. . . . Subse- 
quently it becomes also a means whereby he 
organizes his own experience and regulates 
his own actions. So the child's activity is 
mediated through words (p. 116). 


These observations and their inter- 
pretations of noninteracting parallel 
processes point to the complex inter- 
relationships existing between verbal 
behavior on the one hand and problem 
solving on the other. If nothing else, 
they destroy the illusion that it is reas- 
onable to describe an organism as ver- 
bal or nonverbal without considering 
the problem with which it is confronted. 
The terms verbal and nonverbal be- 
come meaningful—and fruitful—when 
related to specific problem solving 
tasks. 

It would seem fruitful to investigate 
the cue function of words for children 
of two age levels. One possibility is 
that age influences problem solving 
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only in so far as it leads to the acqui- 
sition of words. If younger children, 
say 4 years of age, could acquire the 
same words as 7-year-olds, they would 
solve a simple concept-learning prob- 
lem the same way. The other possibil- 
ity is that the acquisition of the verbal 
label by itself is not sufficient ; the word 
must be integrated with other be- 
havioral chains to influence problem 
solving behavior. And for this to hap- 
pen some developmental changes must 
first take place, 

In order to test these two alterna- 
tives, children of 4 and 7 years of age 
were presented with another variation 
of the reversal shift problem as shown 
in Figure 6. They initially learned a 
simple discrimination between a pair 
of stimuli that varied simultaneously in 
size and brightness. In the illustration 
provided in Figure 6, the large black 
square is correct. While they were 
learning, the children were required to 
verbalize aloud the stimuli to which 
they were responding. One-third 
learned to say "large" (or "small" as 
the case may be) by the simple device 
of instructing them to tell the experi- 
menter which was correct, the large 
or the small one. Another third 
learned to say "black" (or “white’’) 


in a corresponding way. The remain- 
ing third was not required to say any- 
thing. After learning the discrimina- 
tion, all subjects were presented with 
a reversal shift. In the example de- 
picted in Figure 6, the shift is to small 
regardless of size. Thus, the group 
that initially described the correct 
stimulus as "large" had verbalized the 
relevant dimension. The verbal re- 
sponse of "black" was irrelevant to this 
reversal shift. 

Figure 7 shows the results of the 
three experimental groups for the two 
age levels. If developmental processes 
affect the utilization of verbal responses 
in problem solving, then it would be 
expected that the three verbalization 
conditions (which produced a signifi- 
cant main effect) would influence the 
behavior of the two age groups dif- 
ferently. These results suggest, but 
not quite at a significant level, that 
there is an interaction effect. Figure 
7 shows that the younger children 
profited by making the kind of verbal 
response appropriate to a reversal shift, 
while they were hindered by learning 
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inappropriate verbal responses. With 
no verbalization the 7-year-old chil- 
dren who presumably were responding 
largely in a mediational manner, ac- 
complished a reversal shift much more 
rapidly than their younger counter- 
parts. But unlike the 4-year-olds, they 
did not profit from being trained to 
make the relevant responses. At 7 
years of age they are capable of making 
the response themselves and outside 
help appears to be of little use. In 
contrast, the influence of irrelevant 
verbalizations is marked. The per- 
formance of the 7-year-olds was even 
poorer than that of the 4-year-olds, 
suggesting that the interfering effects 
of being given an inappropriate medi- 
ated response are greater when one is 
capable of spontaneously generating 
the correct one (7-year-olds) than 
when one is not (4-year-olds). 

How are these data to be explained? 
Attributing differences to developmen- 
tal factors is not sufficient. It is neces- 
sary to represent developmental differ- 
ences in terms of the concepts of the 
behavior model that is being used. 
That is, if a verbal label for a young 
child does not possess the same cue 
function as it does for an older child, 
then it becomes necessary to specify 
how and why this comes about. To 
some extent this has been done by em- 
phasizing the transition from a single 
unit to a mediational system, as well 
as suggesting that with age an increase 
occurs in interaction among chains of 
different vertical levels. But obviously 
this analysis of the developmental proc- 
ess demands further theoretical and 
empirical development. 

These studies are intimately related 
to the oft-reported finding that many 
species of subhuman animals are able 
to make a fairly rapid reversal shift 
if they receive a previous series of 
such shifts. Rats (Buytendijk, 1930; 
Krechevsky, 1932) show @ marked 


improvement in executing successive 
reversals. They finally reach a point 
(Dufort, Guttman, & Kimble, 1954), 
in a T maze, where they learn to go to 
a new rewarded goal after making only 
one error. Even more dramatic are 
the rapid discrimination reversals ex- 
hibited by Harlow's (1949) monkeys. 
But fish (Wodinsky & Bitterman, 
1957) exhibit only a slight improve- 
ment in successive reversals, while 
isopods (invertebrates) show no im- 
provement (Thompson, 1957). 

Because of the necessity to use some- 
what different experimental procedures 
for different species, it is difficult to 
draw an unqualified conclusion about 
the ability of different species to trans- 
fer what has been learned from previ- 
ous reversal shifts to a new one. But 
the suggestion is strong that as you 
ascend the evolutionary scale organ- 
isms acquire a greater capacity to gen- 
erate cues that enable them to make 
rapid reversal shifts. This behavior, 
according to our analysis, borders on 
the language responses of humans. 
The main difference is that our hu- 
man subjects, except those of a very 
young age, exhibit rapid reversals with- 
out any previous reversal training. 
Whereas the human automatically 
seems to generate a mediated response 
that provides the basis for his rapid 
reversal, the animal subject must grad- 
ually acquire an ability to respond ap- 
propriately to some response produced 
cue resulting from nonreinforcement 
of a previously correct response. 

Up to now, the reversal and non- 
reversal technique has been used to 
investigate mediational and develop- 
mental variables. It has proved suf- 
ficiently flexible to be used in a study 
(Kendler, Glucksberg, & Keston, 
1961) which was designed to lengthen 
a problem solving chain so that the 
interaction between various segments 
could be observed. In this study a 
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Fic. 8. The hypothesized behavioral chain operating at the time the subject was being 
shifted to the second concept. (Capital letters refer to directly observable stimulus and 
response events, while small letters refer to those that are inferred.) 


perceptual orienting S-R unit was 
added on to the mediational chain al- 
ready described. Figure 8 illustrates 
in an oversimplified manner the be- 
havioral sequence involved in this 
study in which subjects had to learn 
to press the correct button when two 
physically discrete and spatially sepa- 
rate stimulus patterns were projected 
on a screen at such a rapid rate that 
only one could be perceived on any 
trial. During the learning of each of 
two successive concepts (involving 
either a reversal or nonreversal shift), 
the subject had to pay attention to the 
relevant stimulus pattern while ignor- 
ing the irrelevant one. Thus, in order 
to make the correct overt response 
consistently, a subject initially had to 
make the appropriate orienting re- 
sponse in order to perceive the relevant 
stimulus pattern to which he had to 
make the correct mediational response 
which served as the cue for the key- 
pressing act. 

An experimental design was used in 
which, at the time of the shift from the 
first to the second concept, one group 
had already learned the appropriate 
orienting response as well as the ap- 
propriate mediating act. They needed 
only to learn a new terminal key- 
pressing response. The shift, for them, 
was easy to make. In contrast, the 
behavior of three other experimental 
groups was significantly worse. One 
group had to learn a new orienting 
response, e.g., look to the left instead 
of the right. Another group had to 
learn a new mediated response (i.e., 
they were required to make a non- 


reversal shift). The last group had to 
acquire both a new orienting and medi- 
ated response. The fact that the 
groups which were missing one or both 
of the necessary behavior units (orient- 
ing and mediated responses) did not 
differ significantly among themselves, 
as well as being much poorer than the 
group that had both, highlights the 
problem of synchronizing the S-R units 
in a behavioral chain. The advantage 
in this study of having one appropriate 
unit without the other is at best neg- 
ligible. The reason for this is that 
reinforcement is only achieved con- 
sistently when both the appropriate 
orienting and mediating responses are 
operating. This particular study points 
to the need for discovering laws asso- 
ciated with the strengthening and 
weakening of independent S-R units 
in a problem solving chain, as well as 
the principles governing their syn- 
chronization. 

This study also highlighted a very 
basic problem in all of these reversal 
studies. This problem has to do with 
the very first correct response follow- 
ing the reversal shift. After discover- 
ing that the previous mode of respond- 
ing is erroneous, what makes the 
subject change his response, i.e., push 
the button that was previously wrong! 
Introspective reports fail to provide 
any clearcut answer and even if the 
did they would be in need of explai?" 
ing (Kendler, 1961). " 

One hypothesis is that the selection 
of the new correct response is due tO 
the operation of a behavioral chain 1” 
addition to the one described in Figure 
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&. The first nonreinforcement in a re- 
versal shift sets off a chain, the conse- 
quence of which is to select the re- 
sponse other than the one that was 
previously correct. This may result 
from a number of different reasons 
(e.g., logical considerations, forgetting, 
etc.). The important point, however, 
is that the new key-sorting response 
occurs contiguously with the implicit 
mediational response appropriate to a 
reversal shift. As a result, a new as- 
sociation is formed between the old 
implicit cue and the new key-pressing 
response. 

In essence, what is being stated is 
that adult subjects, when making or 
deciding to make the first correct post- 
shift response, do not adopt the prin- 
ciple underlying a reversal shiít. In- 
stead, it is assumed processes are 
operating which encourage the selec- 
tion of the correct response while an 
implicit cue appropriate to a reversal 
shift is operating. This sort of an 
analysis was described previously 
(Kendler & Mayzner, 1956) as 
sort of a James-Lange theory of problem 
solving . .. one makes the overt correct... 
response and if the appropriate symbolic cue 


is present, then problem solution will occur 
(pp. 247-248). 


Guthrie (1959) says the same thing 
more neatly: “What is being noticed 
becomes a signal for what is being 
done” (p. 186). 

Again the authors would like to 
guard against giving the impression of 
oversimplifying a terribly complex 
problem. They do not believe the 
contiguous occurrence of an implicit 
cue from one chain with the correct 
overt response from another tells the 
whole story. This new association in 
order to persist must be reinforced and 
in some manner “fit into” the subject’s 
ongoing behavioral chains. — 

'The emphasis on this vertical con- 
nection between a cue and a response 


irom different chains is related in a 
distant way to Hebb’s (1958) stressing 
the role of “chance” in problem solv- 
ing: 

There are few scientists who have not had 
the experience of setting out to solve prob- 
lem A and ending up instead with the solu- 
tion to B. . . . This is serendipity, the art 


of finding one thing while looking for an- 
other (p. 215). 


According to the present analysis, 
serendipity results from the adventi- 
tious and contiguous occurrence of a 
cue and a response which are them- 
selves segments from different be- 
havior chains. Theoretically it should 
be possible. to demonstrate this point 
experimentally by training subjects to 
respond simultaneously to two sepa- 
rate tasks. A problem then would 
have to be presented that requires for 
its solution the combination of a stimu- 
lus from one chain with the response 
from the other. In such an experi- 
mental situation, controlling the time 
relationship between the two should 
have an important effect on problem 
solving. Presumably contiguity be- 
tween the two should provide the most 
optimal conditions for problem solving 
(Underwood, 1952). The develop- 
ment of this kind of experimental pro- 
cedure should allow for parametric 
studies of the basic variables of the 
phenomenon which has commonly been 
called “insight,” as well as throw light 
upon issues raised by others (eg. 
Cofer, 1957; Maltzman, 1955; Saug- 
stad, 1957). 

The pretheoretical model that guides 
the present research has many more 
facets that can be exploited. Only one 
will now be mentioned. Glucksberg 
(1962), for example, extended neo- 
behavioristic drive theory (Spence, 
1956) to problem solving. He used a 
functional-fixedness problem (Adam- 
son, 1952, Duncker, 1945) in which 
the correct response in the habit hier- 
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archy could either be made to be low 
or high. If the correct habit was low, 
it would be expected that a strong 
drive would retard problem solving 
because it would retard the extinction 
of the dominant incorrect response 
(Kendler & Lachman, 1958; Perin, 
1942). Since drive energizes behavior, 
a high drive should facilitate problem 
solving performance when the correct 
habit is dominant. The findings were 
consistent with this analysis. 

Because functional-fixedness prob- 
lems are often represented in percep- 
tual terms, Glucksberg was interested 
in seeing whether the same drive model 
could be applied to a simple perceptual 
recognition problem in which subjects 
were instructed to identify tachisto- 
scopically presented words as rapidly 
as possible. The results were similar 
to those reported for the functional- 
fixedness study: when the correct re- 
sponse was dominant, an increase in 
drive improved performance, i.e., the 
visual duration threshold was lowered. 
In contrast, increasing drive when the 
correct response was low in the hier- 
archy raised the threshold. 

There is obviously still much more 
work, both empirical and theoretical, 
needed to develop the model that has 
been described. At this point it may 
be appropriate to summarize the major 
points of this paper. 

There is not just one way to investi- 
gate problem solving. The researcher 
who is interested in problem solving 
has several different pretheoretical 
models from which to choose. This 
paper reported the results of a research 
program based on an S-R model in 
which the importance of horizontal and 
vertical processes were emphasized. 
Horizontal processes refer to the link- 
ing of successive S-R units into a 
behavioral chain, while vertical proc- 
esses refer to the assumption that in- 
dependent chains occur simultaneously. 


A series of experiments was reported, 
the implications of which supported 
postulating a mediational mechanism 
within a behavioral chain. 3y com- 
paring the behavior of human subjects 
of different ages, as well as relating 
their results to lower animals, it was 
possible to infer that as a child matures 
he makes a transition from responding 
on the basis of a single unit S-R mech- 
anism to a mediational one. Addi- 
tional data were cited that suggest the 
full impact of verbal behavior on prob- 
lem solving depends on developmental 
processes that encourage interaction 
between chains at different vertical 
levels. It was also suggested that 
problem solving begins in a simple 
concept learning task when a correct 
overt response from one behavioral 
chain occurs contiguously and adven- 
titiously with the appropriate implicit 
cue from another chain. The paper 
was concluded by citing findings that 
suggested the neobehavioristic drive 
theory which assumes that the effect 
of different levels of drive depends on 
the position of the correct response in 
the habit hierarchy is applicable to a 
functional fixedness problem as well as 
à perceptual-recognition task. 

If nothing else, it is hoped that the 
present paper demonstrates that it is 
possible to investigate problem solving 
in a systematic fashion. If more psy- 
chologists accepted this possibility and 
were willing to expend their research 
energies in the field of problem solving, 
progress in this area would be greater 
than it is today, 
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A QUANTITATIVE THEORY OF FEELING: 1960* 
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Florida State University 


In recent years considerable evi- 
dence has been presented which is 
contrary to the theory of von Frey 
(1895) that qualitative differences in 
cutaneous sensations are dependent 
upon the selective action of specialized, 
encapsulated end organs. Much of 
this evidence is from histological stud- 
ies of skin of various parts of the body 
and leads to the conclusion that en-, 
capsulated end organs cannot be found 
in hairy skin. Yet hairy skin exhibits 
sensitivity equal to or greater than that 
found in glabrous skin (Sinclair, Wed- 
dell, & Zander, 1952). 

This paper reviews studies con- 
cerned with the manner in which 
nerves terminate in skin and their role 
in cutaneous sensitivity. The histo- 
logical studies lead to the conclusion 
that nerve fibers supplying hairy skin 
terminate as bare, unencapsulated fila- 
ments in relation to three types of 
tissue: undifferentiated epithelial cells 
of the dermis and lower layers of the 
epidermis, hair follicles, and smooth 
muscle elements of the cutaneous blood 
vessels. Nafe (1934, 1942) has pro- 
posed that these bare filaments are es- 
sentially alike and hence respond to a 
common adequate stimulus, namely, 
movement either in relation to them- 


1 The first paper under this title was pub- 
lished by John P. Nafe in 1929. The present 
paper was presented in modified form at 
a symposium on Cutaneous Sensitivity at 
the Army Medical Research Laboratory, 
Fort Knox, Kentucky, February 11-13, 1960. 

Supported by Contract No. DA 49-007- 
MD-683 between the Office of The Surgeon 
General, Department of Army, and Florida 
State University. Reproduction 1n part or 
in whole is permitted for purposes of the 
United States Government. 
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selves or to the surrounding tissue. 
These terminals differ primarily in 
that they terminate in different tissues 
of the skin. Both tactile and thermal 
sensations result when tissue is moved, 
the former by the direct action of a 
mechanical stimulus on the tissue, and 
the latter, by movement of the ther- 
mally labile smooth muscle, especially 
of the arterioles. 


HisroLocy oF SKIN RECEPTORS 


Rose and Montcastle (1959) have 
reaffirmed the position of most text- 
book writers that the skin senses are 
mediated by specialized encapsulated 


endings. Commenting on the recent 
publications by the Oxford group 


( Weddell, Palmer, & Pallie, 1955), in 
which it is maintained that there is but 
a single type of nerve ending mediating 
cutaneous sensations, Rose and Mont- 
castle (1959) say: 


If a crisis exists in respect to evaluating 
the morphology of the endings, it is a crisis 
of abundance and not of scarcity. One hesi- 
tates to accept as a solution to the vexing 
problem of the morphology of the encap- 
sulated endings a declaration that virtually 
all morphological differences between them 
are either insignificant or due to artifacts 
of the technique (p. 390). 


If the Oxford group were the only in- 
vestigators to arrive at such conclu- 
sions, one might expect such a 
response as quoted above. The con- 
troversy as to whether or not encap- 
sulated endings mediate the various 
qualities of cutaneous sensations has 
a much longer history. 

Three major methods have been em- 
ployed in the attack on the problem of 
the histology of cutaneous sensation. 
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TABLE 1 


SUMMARY Or HISTOLOGICAL INVESTIGATIONS RELATING SPECIALIZED. ENCAPSULATED 
END OrGans with CUTANEOUS SENSATIONS 


Skin area 
Investigator 


Glabrous Hairy 


Encapsulated endings 


Sensations tested reported 


Type I Investigations 


Donaldson (1885) forearm 
Goldscheider (1886) forearm 
Hüggqvist (1913) forearm 
Dallenbach (1927) upper arm 
Pendleton (1929) forearm 
Woolard (1935) thigh 
Bazett (1941) forearm 
Weddell (1941) forearm 


Belonoschkin (1933) nipple 


cold none 

cold, warm, touch, pain | none 

cold, warm none 

cold, warm none 

cold none 

cold, pain, touch none 

cold none 

cold Krause end bulb 

cold complicated plexuses with- 


out capsules 


Type II Investigations 


von Frey (1895) conjunctiva cold Krause end bulb 
cornea cold, warm, touch, pain | none 
forearm cold Krause end bulb 
Strughold (1925) mouth mucosa cold Krause end bulb 
Strughold and Karbe (1925) | conjunctiva cold Krause end bulb 
Bazett et al. (1932) prepuce cold, warm, touch Krause end bulb, Ruffini 
$ N cylinders 
Sinclair et al. (1952) finger cold, warm, touch Meissner corpuscle, 
Merkel's discs, Krause 
and Ruffini endings 
forearm cold, warm, touch none 
auricle cold, warm, touch none 
Type III Investigations 
Gilbert (1929) thigh none 
epigastrium none 
fies none 
reast none 
Gilmer and Haythorn (1941) back none 
Hagen et al. (1953) abdomen none 
finger dorsum none 
Dastur (1955) ae gpeetmens from none 
: hairy skin 
Winkilmann (1955) hairy skin none 
Woolard (1936) thigh Meissner corpuscle 
" Merkel's disc 
Gilbert (1929) faot, ple Meissner corpuscles 1 
Golgi- i capsules 
Williams et al. (1929) prepuce ee arron ca 
Ruffini cylinders 
Meissner corpuscles 
1 4 unnamed types 
Cathcart et al. (1948) nipple PES end bulb (rare) ids 
, unnamed smaller caps. 
Gairns (1955), Gairns (1951) | palate Meissner corpuscles 
gum Krause end bulb 
y unnamed varieties 
Gairns (1953) tongue Krause end bulb 


eissner corpuscles 


The first involves careful sensory test- 
ing and permanent marking of _ the 
points on the skin which are sensitive 
to thermal and/or tactile stimulation. 
After sensory testing and marking, the 
tissue is removed, stained, and ex- 
amined microscopically. This is the 


most direct approach. The second 
method involves a determination ° 
the number and kind of sensory spot 
found in a skin area followed by, d 
histological examination of the tissue- 
The number and type of encapsulated 
end organs are related to the fre- 


Quantitative THeory oF FerriNG: 1960 19 


quency and kind of sensory spots, but 
no attempt is made to draw a close 
relationship between a spot and a par- 
ticular end organ as in the first method. 
The third only describes the general 
location and distribution of various 
kinds of encapsulated endings. The 
second and third methods give sup- 
portive data at best. 

Investigations of the first type have 
been carried out at least nine times 
since the first report of Donaldson 
(1885). With the exception of the 
study reported by — Belonoschkin 
(1933), these investigations have been 
carried out on hairy skin. It will be 
noted in Table 1, with one exception, 
that the investigators have failed to 
find encapsulated endings of any sort 
beneath the spots of stimulation. In- 
vestigations of the second type have 
been conducted primarily on special- 
ized tissue, such as is found in the 
conjunctiva, the prepuce, and the 
palmer side of the finger. While there 
is no direct correlation between a sen- 
sory spot and the type of ending be- 
neath it, in general, these investigators 
conclude that the number and type of 
encapsulated endings correspond 
rather well with the number and type 
of sensory spots reported in that tissue. 
The list of investigations of the third 
type included here is by no means ex- 
haustive. The inclusion of these au- 
thors is on the basis of their interest 
in cutaneous sensation. It will be 
noted that, with one exception, when 
specimens of skin were removed from 
the hairy regions of the body, encap- 
sulated endings were not found; 
whereas, when the specimens of skin 
were obtained from the nonhairy por- 
tions of the body, encapsulated endings 
were quite prevalent. — m 

In this entire list of investigations 
there are three exceptions to the gen- 
eral conclusion that encapsulated end- 
ings do not occur in hairy skin. These 


are Weddell (1941), in the first type 
of investigation; von Frey (1895), in 
the second type of investigation; and 
Woolard (1936), in the third type of 
investigation. Weddell et al. (1955) 
now contend that the structures which 
he and Woolard (1936) reported as 
being encapsulated endings were either 
unusual sections of hair follicles, or 
artifacts which resulted from rough 
handling of the specimens during the 
fixing process. This appears to be 
reasonable in view of the results ob- 
tained by the other investigators. The 
conclusion is inescapable that encap- 
sulated endings do not occur in hairy 
skin. They exist only in specialized 
skin areas as the conjunctiva of the 
eye, the mucous membrane of the 
mouth, the palmer surfaces of the fin- 
gers, and the palms of the hand, the 
soles of the feet, and the genitalia. In 
hairy skin nerves terminate as bare 
filaments. The manner of termination 
of these afferent nerve fibers has been 
investigated by Weddell, Pallie, and 
Palmer (1954), and the work of other 
investigators has been reviewed by 
Weddell et al. (1955). In general, 
afferent nerves terminate as bare fila- 
ments in relation to three types of 
structures found in the skin. These 
are: in and among the cells of the stra- 
tum granulosum of the epidermis and 
among the cells of the dermis; some of 
the largest fibers serving the skin end 
in relation to hair follicles, as filaments 
entwined about the hair shaft; and 
other smaller fibers terminate in rela- 
tion to the smooth muscle cells of the 
cutaneous blood vessels. 

Punctate sensitivity of the skin, as 
described by Blix (1884) and Gold- 
scheider (1884) is often cited as sup- 
porting evidence for the specialized en- 
capsulated receptor theory. Some 
areas or spots of skin are, indeed, more 
sensitive than others when near thresh- 
old intensities of stimulation are used. 
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However, in studies in which intensity 
of tactile (Guilford & Lovewell, 1936) 
or thermal (Jenkins, 1940) stimuli 
were systematically changed, the num- 
ber of sensitive spots varied directly 
with the intensity of the stimuli. It 
appears that the skin is more or less 
continuously sensitive to tactile and 
thermal stimuli, exhibiting peaks and 
valleys of sensitivity. 


CUTANEOUS SENSATION 


It is no longer profitable to hypothe- 
size encapsulated end organs which 
are sensitive to specific types of energy 
applied to the skin. Another organiz- 
ing principle must be found to account 
for qualitative differences in sensations 
derived from cutaneous stimulation. 
The filaments in which these nerves 
terminate appear morphologically alike, 
although it cannot be maintained that 
they are identical (Sperry, 1950; 
Weiss, 1947). On the other hand, 
there seems to be little reason to as- 
sume that the differences are suffi- 
ciently great to account for the wide 
variety of sensory experiences derived 
from their stimulation. Since there is 
no reason, at present, to assume that 
they are different, it would be expected 
that the terminals of these cutaneous 
fibers would share a common adequate 
stimulus, regardless of the tissue in 
which they end. Evidence has been 
presented (Nafe & Kenshalo, 1958; 
Nafe & Wagoner, 1941a, 1941b) which 
indicates that movement is this com- 
mon adequate stimulus. Differences in 
the qualities of cutaneous sensations 
are based upon differences in the prop- 
erties of the tissue innervated. Tactile 
sensations are aroused by and during 
movement of dermal and epidermal 
tissue. Thermal sensations are also 
aroused by tissue movement, although 
in this instance, the tissue is the smooth 
muscle composing the walls of the 


cutaneous vascular (Nafe, 


1942). 


system 


The Tactile Sense 


Two components exist in the appli- 
cation of any mechanical stimulus to 
the skin. These are a static com- 
ponent (or magnitude of displace- 
ment), and a dynamic component (or 
velocity and acceleration). Accelera- 
tion has not been well investigated, 
although other work (Hubbard, 1958) 
seems to indicate that it is of little sig- 
nificance. 

The earlier theories concerned with 
the nature of the adequate stimulus 
for the tactile sense considered the 
static component of the mechanical 
stimulus to be the relevant aspect of 
that stimulus; thus, there is the pres- 
sure theory (Weber, 1846), the gradi- 
ent theory (Meissner, 1859), and the 
tension theory (von Frey & Kiesow, 
1899). von Frey and Kiesow formu- 
lated their theory on the basis of stim- 
uli involving very small areas; that 
is, stimuli with radii of less than 0.1 
millimeter. They found that tension 
(weight per unit radius) was related 
to the threshold of sensation for these 
very small stimuli. On the other hand, 
for larger stimuli, greater than 0.4- 
millimeter radius, they were able to 
relate only the rate of application of 
the stimulus to the threshold of sensa- 
tion. Apparently, von Frey and Kie- 
Sow were so convinced of the impor- 
tance of the static component of the 
mechanical stimulus that they ignored 
the rate factor involved in the stimuli 
of larger radii, 

Nafe and Wagoner (1941a, 1941b) 
were the first to record the motion ? 
à mechanical stimulus in producing ^ 
tactile sensation. Among the recorded 
variables of weight, area, time to cess4- 
tion of the sensation, and the rate O 
Stimulus movement at the cessation © 
the sensation, a relationship was es- 
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tablished between weight and the rate 
of stimulator movement at the time 
of cessation of the tactile sensation. 
The rate of movement varied inde- 
pendently of the area at the time of 
cessation of sensation; hence, any as- 
pect of the stimulus which involves this 
variable, e.g., pressure or tension, must 
be considered to be of no importance 
in producing and maintaining a tactile 
sensation. In addition, they found that 
depth of deformation of the tissue was 
a function of the applied pressure and 
must also be rejected as the stimulus. 
In other words, they were not able to 
relate any of the static aspects of the 
stimulus to the point at which cessation 
of the sensation occurred. As a result, 
they concluded that the dynamic com- 
ponent was the relevant aspect of the 
stimulus; further, that cessation of the 
tactile sensation was the result of fail- 
ure of the dynamic component of the 
mechanical event to stimulate, rather 
than failure of the receptor to respond 
to the stimulus. The rate of tissue 
deformation had reached a subliminal 
value. 

To test this notion, Nafe and Wag- 
oner (1941b) lowered a weight of 
17.5 grams onto the skin is 8.75-gram 
increments. The subject reported a 
tactile sensation with the application of 
each increment and cessation of the 
sensation when the rate of stimulus 
movement became small. Further- 
more, when the second 8.75-gram 
weight was removed, the subject again 
felt a tactile sensation which disap- 
peared at approximately the same rate 
of tissue movement as before. They 
contend that had the receptors, rather 
than stimulation, failed, the subject 
would not have felt the application of 
the second weight, nor would he have 
felt its removal. 

In order to locate this effect in the 
peripheral receptor mechanism, rather 
than in the central nervous system, 
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STIMULUS EVENT 


a 


NEURAL EVENT 


DISPLACEMENT 


a 


VELOCITY 

Fic. 1. Hypothetical response in a nerve 
fiber to a stimulus. (If the tactile receptor 
responds to the static component of a stimu- 
lus, action potentials should occur through- 
out stimulation as shown in the second line 
labeled “Displacement.” The neural re- 
ponse to the dynamic component of the 
stimulus, in the third line, will last only 
during movement of the stimulus.) 


Nafe and Kenshalo (1958) repeated 
Nafe and Wagoner’s experiments us- 
ing a slightly modified version of the 
Nafe and Wagoner apparatus. In 
place of the human observer, an ani- 
mal, usually a rat, was used. Instead 
of a verbal report by the subject, action 
potentials were recorded from the 
nerve innervating the stimulated tis- 
sue. Simultaneous recordings of the 
stimulus action and the neural dis- 
charge were made on photographic 
paper. In most of the preparations, 
the tongue of the rat was stimulated 
and recordings were made from the 
lingual branch of the fifth cranial 
nerve. Some preparations were made 
of the shaved leg of the rat and records 
were made from the femoral nerve, 
while in other experiments, the skin 
of frog was used. 

Figure 1 shows the type of record 
which would be predicted if the recep- 
tors of the skin respond to either the 
static component (displacement) or 
the dynamic component (velocity) of 
the mechanical stimulus, 

Figure 2 shows a record of the stim- 
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Fic. 2. Action potentials in the lingual 
nerve in response to weight applied to the 
tongue of rat. (Movement of the weight 
into the tissue is shown in the top line. This 
is comparable to only the left half of Figure 

In other experiments the weight was 
later lifted, Action Potentials were also 
obtained during its removal—modified from 
Nafe & Kenshalo, 1958.) 


ulus resulting from an 8.75-gram 
weight sinking into the tissue of rat 
tongue, and the neural discharge 
which results, In this type of re- 
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cording, the stimulator follows the ad- 
justment of the tissue to the weight 
rather closely. It can be seen that 
the neural discharge which accom- 
panied application of the stimulator 
Stopped before the tissue had com- 
pletely adjusted to the added weight. 
The stimulator was still moving into 
the tissue, in this instance at the rate 
of 0.019 mm/sec, when the neural dis- 
charge stopped. Records such as these 
establish the point that the cessation of 
Sensation associated with a minimal 
rate of movement is a peripheral 
event. 

The question still remains as to 
Whether the cessation of discharge is 
à function of the stimulus effectiveness 
alone, or if it is in part due to adapta- 
tion by the receptor. To test this, the 
stimulus was hydraulically driven into 
the tissue in two steps of equal veloc- 
ity, but the second step was of less 
magnitude than the first. If the cessa- 
tion of discharge were associated with 
receptor adaptation rather than a stim- 
ulus Parameter, the neural discharge 
associated with the second step should 
be less than that of the first. On the 
other hand, if rate of movement is the 
determining condition, the discharge 
to the second Step should be equal to 


hydraulically into the tongue tissue in two steps of equal velocity, 


LINGUAL NERVE —RAT 
4 4-26-55 
Fic. 3. Action potentials in the lingual nerve of rat when the stimulator is driven 


(The course of the 


stimulus into the tissue is shown in the top line—modified from Nafe & Kenshalo, 1958.) 
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Fic. 4. Hypothetical cross section through skin showing mechanoreceptors at À and B 
in relation to the stimulator. 


that from the first. 
As can be seen from Figure 3, the 
second hypothesis is confirmed. The 
longer discharge associated with the 
second step is explained by greater 
mechanical stress in the tissue follow- 
ing the application of the first step, 
than prior to its application. Thus, in 
Figure 4, if it is assumed that the ter- 
minals of the fiber from which the rec- 
ord was made are at A, it is apparent 
that movement of tissue at A will be 
greater per unit movement of the stim- 
ulator when A is under mechanical 
stress than when it is not. The ob- 
servations of Loewenstein (1956) 
emphasize the importance of the me- 
chanical factors involved in the stimu- 
lation of skin receptors. He has re- 
ported that the duration of a single 
unit discharge in response to a stand- 
ard stimulus in frog skin is greater 
when the skin is placed under me- 
chanical stress than when it is flaccid. 
Our own observations of a decrease in 
the velocity of stimulus movement 
with increasing weights applied over 
a constant area, may be explained in a 
similar fashion. One might think of 
the relation of the stimulus and recep- 
tor as analogous to a mechanical gear 


or greater than 


train with a large amount of backlash. 
As the gears are put under mechanical 
stress, by loading, e.g, the use of 
spring loaded antibacklash gears, the 
output of the gear train follows that 
of the input more closely. Similarly 
the tissue at A follows movements of 
the stimulator more closely. 

To test these notions it seemed ap- 
propriate to use an unyielding tissue. 
The coupling between the stimulator 
and receptor would be closer and tissue 
readjustments to the stimulus action 
would be minimal compared to the 
rat’s tongue. Frog skin is suited for 
such an investigation. As can be seen 
in Figure 5, the neural discharge oc- 
curred only during the time that the 
tissue was being moved. The second 
interesting aspect is that when the 
stimulator was removed from the 
frog’s back a third series of neural 
impulses occurred. 

Movement of tissue appears to be 
the adequate stimulus for nerve termi- 
nals found in a variety of locations. 
Nafe and Wagoner (1941b) used the 
skin of the web between the thumb 
and index finger and just above the 
knee. Nafe and Kenshalo (1958) 
used the tongue and hind leg of rat, 
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Fic. 5. Action potentials in the dorsal 
cutaneous nerve of frog as a result of a 
stimulus driven hydraulically into the skin 
and then removed. (The course of the 
stimulus into the tissue is shown in the 
top line—modified from Nafe & Kenshalo, 
1958.) 


and frog skin. There appears to be 
little difference in the mechanism of 
stimulation of the nerve terminals 
which exist in these various locations. 
Differences which have been observed 
may be accounted for by differences in 
the mechanical properties of the tissue 
surrounding them. It has been known 
for some time that nerve terminals 
surrounding hair follicles respond only 
to movement of the hair (Adrian, 
1930). In addition, the works of Gray 
and Malcolm (1950), Gray and 
Matthews (1951), and Gray and Sato 
(1953) have demonstrated that the 
Pacinian corpuscle is also responsive 
to only the dynamic aspect of the 
stimulus—movement. 

According to this concept, tactile 
adaptation, in the sense of a protracted 
but diminishing sensation or nerve dis- 
charge resulting from a constant stimu- 
lus, becomes meaningless. Consider 
the diagram of Figure 4. The rate of 
lissue movement or adjustment at A 
will be much less than the rate of 
movement of the stimulator, but as it 
continues to sink into the tissue, the 


rate of tissue movement will approach 
that of the stimulator. If it is as- 
sumed that a receptor is at A, it is 
clear that the stimulus to which the 
receptor responds is by no means con- 
stant, although the stimulus object, 
which the experimenter Sees, is con- 
stant—it weighs 5 grams, or exerts 5 
grams/cm? pressure. If a solid plate 
such as bone is placed immediately 
beneath the diagram, shearing forces 
are also produced because the tissue 
will be squeezed between the stimu- 
lator and the bone. The result of this 
added complication will be that reso- 
lution of these stresses will require 
much longer at A than at D. Ac- 
cording to the older electrophysiolog- 
ical literature, in comparing the neural 
discharges from receptors at À and B, 
it would have been said that A was 
slow adapting while B was fast adapt- 
ing, implying a difference in the neuro- 
logical functioning of each rather than 
in tissue mechanics. An important 
parameter of the problem is overlooked 
if the mechanical properties of the tis- 
Sue are not also investigated, 


The Thermal Sense 


Before considering proposals for 
thermal receptors, the facts concerning 
thermal sensation should be examined, 
They impose rather stringent require- 
ments upon any theory concerned with 
the nature of thermal reception. The 
thermal sense is different from other 
Senses. In the other senses absence 
Of sensation is associated with near 
zero energy levels. Zero thermal en- 
ergy is at absolute zero, approximately 
—273* C: yet. psychological zero or 
zero thermal sensation is at skin tem- 
perature—approximately 33° 52 d 

Stimulation of thermal receptors 7i 
Pends upon a temporal thermal gradi- 
ent. Within limits, thermal sensations 
occur only when the stimulus tem- 
perature change occurs at above a 


v 


| 
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minimal rate. Skin temperature is 
known to shift spontaneously by as 
much as 0.5° within the space of a few 
minutes (DuBois, 1941); yet during 
these shifts the subject is not aware 
of any thermal sensation. The ob- 
servations of Hensel (1950) indicate 
that skin temperature may be shifted 
within the range of 29°-36° C with- 
out producing a thermal sensation, 
provided the change is at a rate of 
less than 0.007? C/sec. This appears 
to be the zone of complete thermal 
adaptation and hence specifies the 
limits of the shifting psychological 
zero. The shifting zero point accounts 
for the frequent observation that the 
thermal reception system in man is 
not a good absolute thermometer. 
While thermal sensations do not occur 
with large changes in skin temperature 
at a slow rate, threshold sensations oc- 
cur at a rate of 0.001? C/sec for 3 
seconds (Hardy & Oppel, 1936). 

The thermal threshold is affected by 
the temperature of the skin ( Hensel, 
1950). Ebaugh and Thauer (1950) 
have reported that the cool threshold 
is increased at skin temperatures above 
33? C. Kenshalo, Nafe, and Dawson 
(1960) confirmed their findings and, 
in addition, were able to show that the 
warm threshold decreases at skin tem- 
peratures above 34? C. 

Theories which depend on a spatial 
thermal gradient must be rejected. 
Ebbecke (1917) and Bazett and Mc- 
Glone (1932a) have suggested spatial 
gradient theories. The notions of 
Bernhard and Granit (1946) and von 
Euler (1947), that nerve itself may 
a thermal receptor, is also based 


act as 
on regional cooling of the nerve. Lele, 
Weddell, and Williams (1954) have 


extended this notion and suggested a 
Bolometer Theory which depends on 
warming and cooling of ame. in 
relation to their stem axons. € 
s . . e S 
tion of spatial gradients as 4 mea 


stimulating thermal receptors requires 
some justification. Hensel and Zotter- 
man (1951c), using electrophysiolog- 
ical techniques, located the termination 
of a fiber responding to cold close to 
one side of cat's tongue. They found 
that the receptor performs equally well 
regardless of which side of the tongue 
was cooled. The older work of Bazett, 
McGlone, Williams, and Lufkin 
(1932) on the prepuce, likewise, dem- 
onstrates the ineffectiveness of the 
direction of the spatial thermal gradi- 
ent as a means of stimulating warm 
receptors. Vendrik and Vos (1958) 
abolished a spatial thermal gradient 
altogether by warming the skin with 
a radar beam of 10-centimeter wave 
length, yet were unable to observe any 
appreciable difference in the warmth 
threshold from that obtained by infra- 
red radiation. Electromagnetic wave 
lengths of this order are quite pene- 
trating, according to them, and warm 
the deeper tissues as rapidly as the 
skin surface. If a spatial thermal gra- 
dient were necessary to stimulate the 
thermal sensing mechanism, they 
should have obtained a much higher 
thermal threshold using the radar 
beam than by infrared stimulation. 
The latter raises the skin surface tem- 
perature and depends upon conduction 
of the thermal energy to the deeper 
tissues. 

These experiments demonstrate the 
ineffectiveness of spatial thermal gradi- 
ents in depth as a stimulus for the 
thermal sensing mechanism. It is still 
possible that gradients across the sur- 
face of the skin may be of some im- 
portance, although it seems unlikely. 
Experiments such as those by Hardy 
and Oppel (1936) in which large 
areas of skin, e.g., the torso or torso 
plus face, were warmed uniformly by 
radiant energy should have produced 
thermal sensations only at the edges 
of the stimulus. Their subjects made 
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no such observation. Similarly, the 
thermal sensation aroused by laying a 
cold coin on the skin should occur only 
at the margin, but this is not consistent 
with common observation. 

Two types of receptor mechanisms 
have been proposed for mediating 
thermal sensations. One assumes that 
cutaneous nerve terminals are differ- 
entially sensitive, some responding 
only to mechanical energy, while 
others respond to thermal energy. 
The other maintains that endings of 
the skin respond to a common ade- 
quate stimulus—movement. Thermal 
reception would then be by means of 
tissue movement resulting from ther- 
mal stimulation. It is well known 
that smooth muscle is thermally re- 
sponsive, contracting when cooled and 
relaxing when warmed. Such move- 
ment induced in smooth muscle of the 
cutaneous vascular system would pro- 
duce impulses in the nerve fibers ter- 
minating among these smooth muscle 
elements. 

The notion that nerve terminals re- 
spond differentially to mechanical and 
thermal energy is supported by evi- 
dence from Hensel and Zotterman 
(1951b). "They found that stem axons 
which show activity in response to the 
application of mechanical stimuli to 
their terminals do not respond well to 
thermal stimuli and vice versa. Their 
findings indicate that mechanically 
sensitive nerve terminals have stem 
axons of larger diameter (larger than 
8 micra); whereas, those responding 
to thermal stimuli are smaller (1.5-5 

micra). They interpret their findings 
as indicating that nerve terminals are 
inherently different in their respon- 
siveness to thermal and mechanical 
energy. However, such differences 
could occur as a result of differences 
in the mechanical and thermal prop- 
erties of the tissue in which the nerve 


terminals end. 


Several mechanisms have been sug- 
gested to account for the apparent dif- 
ferential sensitivity of nerve terminals. 
Bishop (1946) has related it to charac- 
teristics associated with the size of 
the stem fiber involved. Others have 
suggested that the nerve terminals 
differ in their sensitivity to chemical 
changes. Bazett and | McGlone 
(1932b) proposed that thermal sensa- 
tions might be the result of alterations 
in blood acidity produced by warming 
and cooling. Consistent ‘with this, 
Dodt (1956) has demonstrated a dif- 
ferential effect of CO, tension on the 
response of nerves stimulated ther- 
mally. In his Concentration Theory, 
Jenkins (1941) has also suggested a 
chemical intermediary. Bare nerve 
endings, according to Jenkins, would 
discriminate between a catabolic and 
an anabolic phase of some chemical 
reaction brought about by a change in 
temperature. Presumably bare nerve 
endings responding to mechanical 
stimuli would be insensitive to these 
changes. 

The suggestion of Hensel and Zot- 
terman (1951b) on the mechanism of 
the stimulation of cold receptors is 
reminiscent of the anabolic-catabolic 
hypothesis. Because of the rather 
complex shifts in the frequencies of 
impulses which they found in fibers 
responding to a temperature reduction, 
they suggest that there are two tem- 
perature dependent processes, prob- 
ably chemical, involved in the stimula- 
tion of cold fibers (Zotterman, 1959). 
The impulse frequency resulting from 
cold stimulation is a function of a dif- 
ference in the activity of these two 
processes. Although they do not spec- 
ulate on the neural response to warm- 
ing, one must presume at least one, and 
probably two, additional processes are 
necessary. Dodt (1953) has shown 
that the responses of mammalian A 
fibers to warming and cooling are 
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functions of their diameter. The dif- 
ferences do not appear great enough, 
however, to aid in understanding Hen- 
sel and Zotterman’s suggestion of a 
chemical process controlling the re- 
sponsiveness of certain fibers to ther- 
mal stimuli. 

Hypotheses which attempt to ex- 
plain differences between thermal and 
mechanical sensitivity on the basis of 
nerve terminals of different sensitivity 
must assume two different sets of cu- 
taneous nerve terminals, one set re- 
sponding to a mechanical form of 
energy, the other to thermal energy. 
If the difference between the two sets 
of terminals is a chemical one, the 
chemical reaction should be one which 
is in equilibrium at the mid-range of 
the thermal energy continuum, which 
may have its thermal equilibrium point 
shifted by 6 or 7 degrees if done suf- 
ficiently slowly, yet which will become 
unstable when changed rapidly by as 
little as 0.003° C (Hardy & Oppel, 
1936). To further complicate the 
theory, Hensel and Zotterman (1951b) 
find that of the terminals responding 
to thermal energy, some are activated 
only by an elevation, while others re- 
spond to a reduction in the thermal 
energy. Thermal fibers must now be 
endowed with even greater selectivity, 
either to discriminate between two 
different chemical reactions, one for 
warming and one for cooling, or be- 
tween the directions in which a re- 
versible chemical reaction is driven. 
In addition, the response of terminals 
to cooling is to be explained by the 
interacting of two temperature depen- 
dent processes, according to Zotter- 
man (1959). í 

An alternative, involving fewer as- 
sumptions, is that nerve fibers and 
their terminals are largely alike; fur- 


ther that the differentiation necessary 
to account for thermal and mechanical 
direct result 


responsiveness occurs as à 


of the tissue in which the fibers termi- 
nate. It has been demonstrated that 
nerve terminals, similar to those ter- 
minating among the cells of the epi- 
dermis, end among the smooth muscle 
fibers of the cutaneous vascular net- 
work (Weddel et al., 1954; Woolard, 
1926). Smooth muscle is known to be 
thermally responsive, relaxing upon 
warming and constricting when cooled. 
Furthermore, Nafe and Kenshalo 
(1958) have demonstrated that move- 
ment of the nerve terminal either upon 
itself or in relation to its surrounding 
tissue best fits the facts concerning 
mechanical reception. 

Goldscheider (1886) was probably 
the first to suggest a relationship be- 
tween the cutaneous vascular system 
and thermal sensation. He noted in 
his biopsies of temperature sensitive 
spots the “striking and immediate 
proximity" of blood vessels to cold 
spots which he has previously mapped. 
Elaborating on this, Nafe (1934) 
pointed out the striking relationship 
between changes in the smooth muscle 
response to temperature and the sub- 
jective reports of stimulation by the 
same temperatures. The figures which 
Nafe (1934) cited applied to smooth 
muscle of the uterus and while they 
apply to smooth muscle in general, re- 
cent evidence indicates that the smooth 
muscle of the cutaneous vascular net- 
work, especially in the venules, 


show an extraordinarily prominent vaso- 
motor activity in contrast to other tissues, 
such as mesenteric structures, the wall of 
the intestinal tract, and skeletal muscle, 
where venous vasomotion is minimal and 
seemingly passive (Zweifach, 1959, p. 69). 


Furthermore, the venular tributaries 
in the skin are ten- to twenty-fold more 
sensitive to a fall in temperature of 
1-29 C than other tissues (Zweifach, 
1959). While the Vascular Theory 
is usually thought of as involving only 
the arterioles the responsiveness of 
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venular smooth muscle and that of the 
hair erector muscles should not be 
overlooked. 

The Vascular Theory accounts for 
the requirements of a thermal theory 
as outlined at the beginning of this 
section. It does not depend upon spa- 
tial thermal gradients to achieve stim- 
ulation and is uniquely suited for ex- 
planations involving temporal thermal 
gradients. Reference was made earlier 
in the paper to the fact that in order to 
stimulate nerve terminals located 
among the cells of the epidermis and 
hair follicles, minimal rates of tissue 
movement were necessary. On the 
assumption that nerve terminals are 
essentially alike, the same conditions 
of stimulation should apply to the 
nerve terminals ending in relation to 
the fibers of the smooth muscle of the 
cutaneous vascular system. The shift- 
ing psychological zero with no attend- 
ant thermal sensation is accounted for 
in terms of the constriction or dilation 
of the cutaneous vessels occurring 
slowly enough to be below the thresh- 
old rate of movement necessary to 
excite the nerve terminals in the ves- 
sel walls. Complete thermal adapta- 
tion represents the adjustment of vas- 
cular relaxation or constriction from 
one level to another; and the range 
through which Hensel ( 1950) has re- 
ported that complete thermal adapta- 
tion is possible, represents the range 
through which the smooth muscle of 
the cutaneous vessels can adjust and 
adopt a relatively stable end point. 
Outside of this range vascular smooth 
muscle would be unable to achieve a 
relatively stable adjustment to the 
temperature. 

Changes in the size of the thermal 
threshold as a function of the skin 
temperature should be expected since 
the latter would govern the degree of 
relaxation or constriction of the 
smooth muscle. Predictions of the 


direction in which thé thresholds 
would go are difficult to make because 
of ignorance of the mechanical and 
physiological characteristics of afferent 
innervation of smooth muscle. 

In addition to fitting these require- 
ments, the theory also handles facts 
which have been somewhat perplexing 
to other theories. Thermal sensations 
occur under some circumstances which 
do not include a change in the environ- 
mental temperature. It has been 
noted, especially in the earlier work 
on the thermal sense, that mechanical 
stimulation of the skin may produce 
sensations of warm or cool (Nafe, 
1934). Smooth muscle is known, 
upon occasion, to relax or contract 
when prodded, depending upon its 
State of contraction at the moment 
(Evans, 1956). Thermal sensations 
resulting from mechanical stimulation 
can be explained in this manner if one 
assumes that they are mediated by 
smooth muscle of the cutaneous vas- 
cular system. Warm sensations ac- 
companying a blush occur as a result 
of chemically mediated vasodilation 
since there is apparently little if any 
reflex vasomotor control in the “blush 
area” (Burton, 1959). The chill ac- 
companying scratching a fingernail 
across a blackboard is the result of a 
reflexly aroused general vasoconstric- 
tion, 

It is not necessary to postulate dif- 
ferent types of free nerve endings, each 
responsive to one type of energy; nor 
is it necessary to conceive of two dif- 
ferent systems, one for warm and one 
for cool. Two possible explanations 
exist. One depends upon neural pat- 
terns while the other depends upon 
different fibers being stimulated, based 
upon the way in which they terminate 
among the smooth muscle elements. 
In the first instance neural patterns 
aroused by constriction could con- 
ceivably differ from those aroused by 
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relaxation. Patterns in other systems 
result in different sensations. For ex- 
ample, the peristaltic waves in the 
stomach passing from the cardiac to 
the pyloric valves are associated with 
hunger pangs, while those same con- 
tractions passing in the opposite direc- 
tion result in the entirely different 
sensation of nausea. The direction of 
travel is simply a matter of timing. 
In addition to central factors involved 
in thirst, the patterns aroused in the 
membranes of the throat when they are 
dry, in comparison to when they are 
wet, contribute markedly to this sensa- 
tion. The urge to micturate and to 
defecate are likewise results of pattern- 
ing of neural discharges. 

On the other hand, Hensel and Zot- 
terman (1951c), using electrophysio- 
logical methods on the cat's tongue, 
have described two apparently differ- 
ent sets of nerve fibers, one respond- 
ing to warm and the other to cool 
stimuli. In striated muscle, such sys- 
tems occur; one responding when the 
muscle is contracted and the other 
when it is stretched. In the first in- 
stance the receptor elements are con- 
nected in series with the muscle fibers; 
whereas, in the latter case they are 
connected in parallel with the muscle 
fiber. Whether the same type of end- 
ings exist in smooth muscle has not 
been investigated, although Fischer 
(1944) has described some afferent 
terminals in smooth muscle as “spray- 
like,” and others as “spindle-like.” 

Hensel and Zotterman (1951b) 
have described fibers responding to 
thermal stimulation as smaller (1.5-5 
micra) than those responding to tactile 
(8 micra and larger). 
(1954) have 
fibers inner- 
d vessels are 
In addition, 
the smallest 
for the most 


stimulation 
Weddell and Pallie 
shown that many of the 
vating the cutaneous bloo 
of dorsal root origin. 

these fibers are among 
found in the skin, being, 


part, less than 3 micra in diameter. 
The fibers ending in cutaneous vessels, 
then, correspond with the fiber spec- 
trum which Hensel and Zotterman 
have described as responsible for ther- 
mal sensitivity. 

One of the most frequently cited 
reasons for considering warm and cool 
to be mediated by different receptor 
mechanisms is the long reaction time 
of an observer to warm compared with 
cool stimuli. Bazett et al. (1932) and 
Hensel and Zotterman (1951a) have 
used this difference in latency to cal- 
culate the depth of the receptors. 
These calculations are based on the 
assumption that the latency of response 
of the receptors are the same but occur 
at different depths. An alternate ex- 
planation, in terms of the Vascular 
Theory, is equally possible. Constric- 
tion is an active process requiring little 
time to accomplish. On the other 
hand, dilation is a passive response and 
requires a much longer time to reach 
a threshold rate of movement. 

Paradoxical cold provides little diffi- 
culty for the Vascular Theory. Para- 
doxical cold is a term ascribed to the 
situation in which a cold sensitive area 
is stimulated by means of a high tem- 
perature and produces a cold sensation. 
At high temperatures smooth muscle 
has constricting elements within a gen- 
erally relaxing system (Evans, 1956). 
It might be expected that an area 
normally responding to a cool stimulus 
would also produce a cool sensation 
when stimulated by a high tempera- 
ture. It is interesting to note that 
Zotterman (1953) reports fibers which 
normally respond to cooling, after a 
period of no response to temperature 
above 35° C. suddenly start to respond 
when stimulated by temperatures of 
45° C. and higher. 

Sensations of heat are the result of 
a mixed sensation of warm and para- 
doxical cold according to Alrutz 
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(1900). It is known (Evans, 1956) 
that when smooth muscle is warmed 
to above approximately 45? C. ele- 
ments of the muscle begin to constrict 
while the main body of the muscle 
continues to relax. According to the 
Vascular Theory hot sensations result 
from the neural activity brought about 
by the constricting elements in the 
generally relaxing smooth muscle of 
the cutaneous arterioles. 

The question arises as to how the 
sensorium distinguishes between tissue 
movements induced by mechanical 
stimuli and those induced by thermal 
stimulation. Evidence has already 
been presented to indicate that there 
is strong likelihood that different af- 
ferent nerve fibers respond to mechan- 
ical stimulation than to thermal. Ter- 
minals ending in the cutaneous 
vascular system, because of their 
smaller size and probably also because 
of the mechanical properties of the 
. tissue of blood vessels, do not respond 
as readily to mechanical stimuli, as 
those which terminate among the cells 
of the epidermis, dermis, and in rela- 
tion to hair follicles. There is no con- 
tention that nerve or its terminals may 
not be stimulated directly by a tem- 
perature change, but it has never been 
demonstrated that their temperature 
sensitivity is sufficient to account for 
thermal sensations in the skin. The 
coding system is based on different 
fibers being activated by different 
types of energy as the result of the 
mechanical and thermal properties of 
the tissue in which their terminals end. 
Further differentiation is accomplished 
in the spinal cord. Descriptions of 
the neuroanatomy of the spinal cord 
(Ranson & Clark, 1959) maintain that 
fibers serving touch travel in the fas- 
ciculi gracilis and cuneatus as well as 
in the ventral spinothalamic tract, 
while those allied with thermal sensa- 
tion and pain ascend the cord pre- 
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dominantly in the lateral spinothalamic 
tract, although there is probably con- 
siderable intermingling of fibers among 
the various tracts (Rose & Mont- 
castle, 1959). Fibers from these tracts 
terminate in different brain stem nu- 
clei but their connections are beyond 
the scope of this paper. 

Aside from the fact that the theory 
can account for a large number of the 
facts concerning tactile and thermal 
sensitivity it has the added advantages 
of simplicity and parsimony. It can 
serve as a convenient model within 
which to organize additional facts as 
well as bringing into sharp relief gaps 
in present knowledge. Furthermore, 
such a model allows predictions and 


Suggests direct experiments which 
should be done. 
SUMMARY 


Qualitative theories requiring spe- 
cialized encapsulated end organs or 
specialized peripheral nerve fibers to 
account for the various qualities (pres- 
sure, warm, and cold) of cutaneous 
sensation fail to present direct evidence 
that such specialization exists in 
peripheral nerves or in their termina- 
tions. The Quantitative Theory, dis- 
cussed here, maintains that qualities of 
cutaneous sensation are, in part, a 
function of the mechanical and thermal 
properties of the tissue in which the 
cutaneous afferent nerves terminate 
and variations in the temperal and 
spatial patterns of the afferent nerve 
discharge. The latter variations are 
manifest in the frequency and duration 
of the impulses, and the area over 
Which they arise as well as the relative 
number of fibers actuated within the 
area. The Quantitative Theory con- 
Sists of four postulates. Y 

l. Nerves which end freely, with- 
Out encapsulation of any sort, are the 
primary neurological elements present 
in hairy skin capable of mediating 
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sensations derived from normal stim- 
ulation of the skin. Nerves ending 
freely in the skin terminate in relation 
to different tissues. 

a. Some fibers branch and termi- 
nate among the dermal and epidermal 
cells of the stratum granulosum. 

b. Others, and probably some 
branches of those terminating as in 
(a) form a complex net about the hair 
follicles. 

c. Smaller fibers branch and ter- 
minate among the smooth muscle ele- 
ments in the walls of the cutaneous 
vascular system and perhaps the hair 
erector muscles. 

2. The terminals of nerve fibers 
supplying the skin are functionally 
similar, differing primarily by the tis- 
sue in which they terminate. Being so, 
they share a common adequate stim- 
ulus which is movement of the ter- 
minals either upon themselves or in 
relation to their immediately surround- 
ing tissue. 

3. Smooth muscle responds to ther- 
mal changes, in general, relaxing when 
warmed but constricting when cooled. 

4. The thermal and mechanical prop- 
erties of the tissue in which the ter- 
minals end determine which fibers will 
respond to mechanical stimulation and 
which will be thermally sensitive. Dis- 
tinction between thermal and tactile 
sensations depends upon the central 
connections which the fibers make. 
Qualitative differences in sensations 
derived from various mechanical stim- 
uli are the result of differing patterns 
of neural excitation. Thus stroking a 
surface results in different 
tterns in the tissue and 
tterns of action po- 
tentials from stroking a rough surface. 
Distinction of warm and cool sensa- 
tions results from either different pat- 
terns of action potentials caused by 
vasoconstriction as compared to on 
dilation, or different fiber termina's 


smooth 
movement pa 
hence different pa 


responding to vasoconstriction than to 
vasodilation. 
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This paper reports an attempt to 
apply a simple association learning 
model to the analysis of the influence 
of response and training variables on 
paired-associate learning. The issues 
under investigation are old ones but 
have not been resolved satisfactorily 
in the past. With the aid of the ele- 
mentary learning model, the problems 
are posed clearly and, the data willing, 
adequately resolved by the use of a few 
simple and intuitively compelling as- 
sumptions about learning. 

The first problem that led to this 
investigation concerns the relationship 
betwen the number of response alter- 
natives (N) and error rate in paired- 
associate learning. Experimental re- 
sults (Noble, 1955; Riley, 1952) are 
in agreement in showing that the num- 
ber of errors subjects make before 
reaching some criterion of learning is 
greater the larger the number of re- 
sponse alternatives. There is little 
agreement, however, as to the inter- 
pretation of this fact. 

Two possible factors could be in- 
volved. First, the effectiveness of a 
reinforced trial in increasing perform- 
ance (ie. the learning rate constant) 
may be influenced by N; and second, 
N may influence the probability of be- 
ing correct by sheer guessing on items 
that are yet unlearned. It is a reason- 
ably safe assumption that N has the 

second effect on chance guessing. 
Previous data are unclear on whether 
N also influences the first factor, the 
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effectiveness of a reinforced trial. This 
is not an easy question to answer since 
it is difficult to separate the effects of 
these two factors in the data. Guess- 
ing occurs only on unlearned items but 
there is no way to tell by direct ob- 
Servation just how many items have 
been learned and how many have been 
guessed correctly on any given trial. 
Moreover, there appears to be little 
hope that more refined or ingenious 
experimental procedures will enable 
us to unconfound these two factors so 
that we may crucially test the hypothe- 
sis that N affects the learning rate. 

This is the type of situation in which 
a theoretical model of learning can 
make a strategic contribution, Indeed, 
without the aid of some formal model 
of the learning process, the question 
of the effect of N can neither be posed 
Clearly nor answered clearly. With 
the aid of a theory one can make 
suitable allowance for the guessing 
factor and thus make àn assessment 
of the contribution of learning and 
guessing at every stage of the ex- 
periment, 

The model to be presented has been 
formally treated in more detail in a 
previous paper (Bower, 1961). Here 
the theory will be presented informally 
and only those implications relevant 
to the present discussion will be intro- 
duced. For a more formal statement 
of the axioms and theorems, the reader 
may consult the prior report. The 
basic notion of the model is the as- 
Sumption that each stimulus item. and 
its correct response become associated 
on an all-or-none basis. Considering 
a single item, it can be in either of 
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two states at the beginning of each 
trial: conditioned to its correct re- 
sponse or not conditioned. If the item 
is conditioned at the beginning of a 
trial, then the correct response occurs. 
If the item is not conditioned, then 
the probability of the correct response 
depends somewhat upon the experi- 
mental procedure. In experiments by 
the writer, the subjects were told the 
N responses (Integers 1, 2, . - - N) 
available to them and were told to 
respond on every trial regardless of 
whether they knew the correct answer. 
If the N numbers occur equally often 
as the to-be-learned responses to the 
items, then the probability that the 
subject will guess correctly on an un- 
learned item is 1/N on the simplest as- 
sumptions ; ° correspondingly, his prob- 
ability of guessing incorrectly is 


2The arguments to be developed apply 
only to those conditions in which the re- 
sponse alternatives are immediately avail- 
able to the subject and he is permitted time 
to give a relevant response on each trial 
before the terminal event (e.g, reinforce- 
ment) occurs. This is the procedure most 
familiar to those who run animal subjects 
in choice situations. A more frequent pro- 
cedure in human learning experiments has 
been to give the subject a fixed time inter- 
val in which he may respond, and the ter- 
minal event (e.g. correct response in paired 
associates, next item in a serial list) follows 
after that time interval regardless of whether 
the subject makes a relevant response. To 
remark on the sociology of experimenters, 
there seems to be an empirical correlation 
between the use of experimenter vs. sub- 
ject-controlled exposure time and whether 
the relevant responses per se (as well as the 
S-R associations) must be learned in the 
experimental situation: when the responses 
must be learned, experimenters usually con- 
trol exposure time ; when the response al- 
ternatives are immediately available, either 
via instruction or construction, the subject is 
allowed to control exposure time. Clearly, 
however, there is no necessary entailment 
between these procedural aspects, and one 
may expect the empty cells of the 2x2 
contingency table to be filled by future ex- 


perimentation. 
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1— 1/N. The following discussion of 
the model is oriented specifically to- 
wards such an experimental procedure. 

With the theory formulated in this 
way, one cannot uniquely specify the 
subject's state of conditioning on a 
given item from the knowledge that 
he made a correct response, since this 
correct response may have come about 
by guessing on an unlearned item. 
Thus, the theory does not imply that 
the first correct response will be fol- 
lowed by correct responses on all sub- 
sequent trials. However, if the sub- 
ject makes an error, then we can make 
a determinate inference that the item 
was not conditioned at the beginning 
of the trial. 

We have introduced the notions of 
the two states of conditioning and the 
probability of the correct response 
given the item’s state of conditioning 
at the beginning of each trial. We 
assume that each item in initially un- 
conditioned and that the effect of suc- 
cessive reinforced trials is to provide 
repeated opportunities for the item to 
become conditioned. The single 
parameter of the theory is the learn- 
ing rate constant (c) which represents 
the probability that an unconditioned 
item becomes conditioned as the result 
of a single reinforced trial. The prob- 
ability that a reinforced trial fails to 
condition the correct response to an 
unlearned item is 1 — c. The prob- 
ability that an item is still not condi- 
tioned after n reinforced trials is 
(1—c)*. If c is larger than zero, 
then this probability approaches zero 
as n becomes large; that is, given a 
large number of reinforced trials, it 
it certain that the item will become 
conditioned on some one of these re- 
inforced trials. When the item be- 
comes conditioned, the probability of 
a correct response jumps from 1/N 
up to 1. 

From the considerations above we 
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may obtain an expression for q,, the 
probability that the subject responds 
incorrectly to a given item on Trial n 
of the experiment. To obtain q, we 
note that the probability that the item 
has failed to be conditioned during the 
preceding n—1 trials is (1 — c)"; 
and if the item is not conditioned then 
the probability of guessing incorrectly 
is 1 — 1/N. Hence, the probability 
of an error on Trial » is given by the 
product of these two factors: 


p= (0 -1/N)—4** n] 


Using this elementary association 
model, the first question of this inves- 
tigation can be restated in a clear man- 
ner: Is c independent of N? If c is 
independent of N, then, given an esti- 
mate of c obtained from one group, we 
should be able to predict in advance 
the performance of other groups 
trained with differing numbers of re- 
sponse alternatives by simply adjust- 
ing the N factor in Equation 1. To the 
extent that these free predictions are 
accurate, we would have good evidence 
for the assertion that N affects guess- 
ing probabilities but has no influence 
on learning rate per se, 

The second question prompting this 
investigation concerns the relation be- 
tween several different training condi- 
tions and performance in paired-asso- 
ciate learning. The two major condi- 
tions of training studied to date are 
the correction and noncorrection con- 
ditions. The more frequently used 
condition is the correction procedure 
in which subject is informed of the 
correct response on every trial regard- 
less of whether he responds correctly. 
In noncorrection training, the subject 

is told right or wrong depending on 
whether his response is correct or in- 
correct. In the following, we will use 
the word “reinforcement” in the gen- 
eral sense intended by Estes (1960), 
viz., as operations exerting certain 


general quantitative effects upon re- 
sponse probabilities. In paired-asso- 
ciate experiments, the reinforcing op- 
eration is that of informing subjects 
of the correct response or in some 
way ensuring that this response is the 
last one to occur in the presence of a 
stimulus item. With this interpreta- 
tion, then, on every trial under a cor- 
rection procedure the correct response 
is reinforced. The noncorrection 
training procedure differs in that the 
correct response is reinforced only on 
those trials when it Occurs; since all 
incorrect Tesponses are followed by 
the — experimenter simply saying 
"Wrong," no explicit reinforcement 
operation is involved. We will refer 
to such trials as nonreinforced trials, 
Within the model, an account of 
the noncorrection procedure takes the 
form of specific assumptions about the 
formation of associations following re- 
inforced and nonreinforced trials. 
Considerations of parsimony lead us 
to assume that the effect on condition- 
ing of a reinforced trial is the same 
whether the reinforcement occurs in 
the context of a Correction or noncor- 
rection procedure, viz., the probability 
that an unlearned item becomes condi- 
tioned to its reinforced response is c, 
the same constant as in the case of 
the correction procedure. A new as- 
sumption that is required concerns the 
effect of a nonreinforced trial, when 
the experimenter says "Wrong" fol- 
lowing an error, The assumption that 
has been made elsewhere (Thorndike, 
1932) and that is made here also, is 
that a nonreinforced trial in this situa- 
tion results in no net change in the 
Probability of a correct response. This 
assumption is not offered as a uni- 
Versal interpretation of nonreinforce- 
ment; however, within the specific sit- 
uation to which the model will be 
applied (ie. the alternative responses 
are equiprobable initially and a num- 
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ber of different S-R events occur be- 
tween successive presentations of any 
given stimulus) there is some basis for 
thinking that the no change assumption 
may be approximately correct. 

From these assumptions about the 
noncorrection procedure, a difference 
equation can be derived expressing 
the change in the average probability of 
an error from Trial n to Trial n + i 


Qni = LACE 
+O ai e)a] [2] 


The first term on the right-hand side 
of Equation 2 exhibits the assumption 
of no change (i.e. qui = qn) when 
the subject makes an error on Trial n, 
with probability qn; the second term 
shows that if a success occurs (with 
probability 1 — qn), then with prob- 
ability c the item becomes conditioned 
(if it is not already) and with prob- 
ability 1 — c the reinforcement was in- 
effective and no change occurs on that 
particular trial. An explicit solution 
of Equation 2 is not available. How- 
ever, Miller and McGill (1952; see 
also Bush & Mosteller, 1955, p. 181), 
have derived the following recurrence 
relation for Equation 2 (the constants 
have been changed appropriately) : 


qa=(1—1/N) U- (1 — c)rt]asa 
prp-4jmu-de BI 


Equation 3 can be used to compute 
successive values of the average error 
probability once N and c are known. 
In general, the predicted learning 
curve is S shaped, a feature which is 
consistent with the expectation of a 
low rate of improvement early in train- 
ing when the subject is trying to dis- 
cover the correct response. The point 
of inflection of the S curve will be 
positively related to N. These fea- 


tures seem in qualitative agreement 


with noncorrection results reported by 


Noble (1955), although his experi- 
mental procedure was more complex 
than the one under present considera- 
tion. 

At this point in the analysis, Experi- 
ment I was carried out with four 
groups of subjects learning the same 
list of 10 paired associates. The vari- 
ables were the number of response 
alternatives. (the first three or first 
eight integers) and correction vs. non- 
correction training procedure. For 
brevity, the four groups will be re- 
ferred to by the symbols 3-C, 8-C, 
3-NC and 8-NC, where 3 or 8 repre- 
sent the number of responses, and C 
and NC designate correction and non- 
correction training, respectively. 
more extensive discussion of the pro- 
cedure and results of Experiment I 
will be deferred till later; it will be 
sufficient here to note one critical result 
of Experiment I since it was this fact 
that led to Experiment II. The fact 
was that strong evidence was obtained 
to support the assumption of no change 
in success probability following a non- 
reinforced trial in the noncorrection 
training groups; that is, with either 
three or eight response alternatives, 
saying "Wrong" followed a subject's 
incorrect response had no effect on 
his probability of success on the next 
trial. 

This no change result supports our 
earlier assumption about nonreinforced 
trials, but how are we to understand 
it? On rational grounds, one might 
expect that subjects would tend to 
eliminate a response which was fol- 
lowed by "Wrong." For example, if 
subjects in the 3-NC group tended to 
eliminate their first erroneous Te- 
sponse, then the probability of the 
correct response on the next trial 
should be around one-half or at least 
greater than one-third. However, the 
results of Experiment I clearly showed 
that this increase did not occur, an 


Lj 
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we seek some explanation for why it 
did not occur. 
An explanation may be found per- 
haps by attending to the responses 
evoked in the subject by saying to him 
“Wrong” after he has responded, eg. 
"three." The reason for this inquiry 
is that, holding to a strict contiguity 
interpretation for the formation of as- 
sociation, it is these terminal reactions 
to the stimulus which have an oppor- 
tunity to become conditioned. Often 
these implicit reactions to “Wrong” 
are primarily emotional; in this re- 
gard, it may be reported that subjects 
frequently volunteered the information, 
"I know I've been getting that item 
wrong, but I can't remember what 
number I said last time." Clearly, if 
the subjects' implicit reactions are pri- 
marily emotional, then we may expect 
no change in the recorded response 
probabilities on the next trial. How- 
ever, if subjects implicitly react to 
"three is wrong" with the response 
"one or two is correct," when the con- 
tiguity interpretation would imply that 
the probability of response "three" 
would decrease and the probability of 
responses "one" and/or “two” would 
correspondingly increase, 
According to this interpretation, the 
noncorrection procedures in Experi- 
ment I did not insure responses of the 
form, e.g, "one or two is correct" 
after the experimenter said "Wrong" 
to "three." For one of the groups in 
Experiment II, conditions were ar- 
ranged to insure the occurrence of re- 
sponses in this form following errors. 
There were three response alterna- 
tives and subjects were instructed that 
one and only one number was correct 
for each nonsense syllable. If, for 
example, the subject responded with 
"three," instead of saying "Wrong," 
the experimenter said "one or two is 
correct." It should be noted that, in 
a formal sense, the subject gains no 


more information about the correct re- 
sponse with this procedure than when 
the experimenter says "Wrong," as in 
the noncorrection procedure. Com- 
parisons of the results of this proce- 
dure with the standard noncorrection 
procedure are given below. 

The second condition run in Experi- 
ment IT was aimed at a slightly differ- 
ent question. The question was 
whether there is some more basic way 
of specifying reinforcement contingen- 
cies in this situation rather than merely 
listing them, correction and noncorrec- 
tion. The formulation proposed here 
is that this basic variable is the de- 
gree to which the experimenter speci- 
fies the correct response following an 
error by the subject; to describe it in 
another way, the variable is the size 
of the subset of alternatives within 
which the one correct alternative is 
said to lie. The correction and non- 
correction procedures occupy the ex- 
treme poles of this dimension; in the 
correction procedure, the experimenter 
uniquely specifies the correct response ; 
in the noncorrection procedure, when 
the experimenter says “Wrong,” he 
implicitly specifies “one of the other 
N — 1 alternatives is correct." 3 To 
illustrate the construction of inter- 
mediate values of this training dimen- 
sion, suppose there were eight response 
alternatives and that the subject re- 
sponds incorrectly with “one” to a 
given item; then the experimenter 
might say "three or six is correct," or 
"six or eight or two or five is correct," 
and so on. For the second group in 
Experiment II, with eight responses, 
the experimenter said two numbers 
following errors. 'The subjects were 
instructed that one of the numbers was 
correct and that the other number was 
a distractor, but there were no cues 


3 However, see the related discussion 
above concerning the subject’s reactions to 
“Wrong.” 
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as to which number was the correct 
one. Of course, if the subject gave 
the correct number, the experimenter 
indicated this to him. 

It is clear that such a continuum of 
training conditions can be constructed. 
It is also clear that variations in this 
training variable should produce 
graded variations in performance in- 
termediate between the two extremes 
produced by the correction and non- 
correction procedures. To describe 
some of the factors involved in this 
prediction, consider the reasons for ex- 
pecting the partial correction group 
listed above (call it the 8-P group) 
to make fewer errors than the 8-NC 
group from Experiment I. 

First, a subject in the 8-P condition 
is expected to make fewer errors be- 
fore his first correct response because 
there is at least some likelihood that 
the correct association will be formed 
following an error. Specifically, we 
assume that before the first correct re- 
sponse the probability that the item 
becomes conditioned following an er- 
ror is c/2, where c is the same learn- 
ing constant as before. In general, if 
the subject is told k alternatives, one of 
which is correct, then we assume that 
the probability of the correct associa- 
tion being formed is c/k on each trial 
before the first success. This formu- 
lation is equivalent to assuming that 
the subject selects at random one of 
the k possibly correct responses to 
rehearse on a given trial, with prob- 
ability 1/k he selects the correct re- 
sponse, and with probability c re- 
hearsal of this correct response results 


in conditioning. 

The preceding analysis applies on 
trials before the first correct response 
occurs. After the subject has been 
once informed of the correct response, 
his probability of recognizing it among 
the two numbers the experimenter says 
(following a subsequent error, if any) 


will be greater than one-half. If the 
subject recognizes the correct number 
and then rehearses it, the effect is 
much the same as a reinforced trial. 
In general, we may let r represent this 
recognition probability following the 
first correct response; because of im- 
plicit rehearsal, 7 is also the prob- 
ability of a reinforced trial on incorrect 
trials following the first correct re- 
sponse. Specifically, we assume for 
the present case that r is unity; that 
is, following the first correct response, 
we assume that subject can recognize 
with Probability 1 the correct number 
among the two numbers that the ex- 
perimenter says following an error. 
Tf the subject recognizes the correct 
response and rehearses it, then with 
probability ¢ the conditioned associa- 
tion is formed. 

To summarize the discussion for 
the partial correction groups, we have 
assumed that prior to the occurrence 
of the first correct response the prob- 
ability of conditioning is c/2 on each 
trial; on the trial of the first correct 
response and those trials following the 
probability of conditioning is c. In 
a later section we suggest a number of 
ways to test the details of these as- 
sumptions. 

In the experiments to be described 
there were six independent groups; if 
the model and theoretical assumptions 
are correct, then the data from the six 
groups can be reproduced after esti- 
mating the constant ¢ from one of the 
groups selected arbitrarily. One can 
easily recognize the advantages of cast- 
ing our theoretical assumptions in ex- 
plicit form within such a model. Not 
only can we clearly pose the questions 
of how N and its interactions with 
training conditions affect error rate, 
but we have also developed a concep- 
tual framework within which it is pos- 
sible to get an answer to these ques- 
tions. The basic learning parameter, 
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c, plays a central role in the theory 
and is not just a curve-fitting constant; 
for a homogeneous population of sub- 
jects with the same learning materials, 
the underlying theory constrains c to 
be the same for all conditions. If in- 
deed such parameter invariance ob- 
tains (i.e., if c can be transposed from 
one condition to the others), then a 
strong appeal can be made for the 
simplicity and power of the underlying 
theory within which c derives its mean- 
ing. 
THE EXPERIMENTS * 


The subjects, 88 Stanford under- 
graduates, were required to learn a 
list of 10 paired associates to a cri- 
terion of two successive correct runs 
through the entire list. The stimulus 
items were nonsense syllables of 0-5 % 
association value chosen for low intra- 
list similarity. The Tesponses were 
the first three or first eight integers, 
paired randomly with the stimuli for 
each subject. With three response 
alternatives, two of the three responses 
were used for three stimuli each and 
the third response, selected randomly 
for each subject, was paired with the 
remaining four stimulus items. With 
eight response alternatives, all eight 
numbers were first paired with eight 
randomly selected syllables for each 
subject, and then the remaining two 
syllables were assigned to different 
but randomly selected numbers for 
each subject. 

The subjects, run individually, were 
instructed to learn a list of 10 non- 
sense syllables and their associated 
number responses, the response al- 
ternatives being either one, two, or 
three (or one, two, . . ., seven, or 
eight), and that the syllables and num- 
hers had been paired off randomly. 
"They were asked to respond within 


*' This experiment was carried out with 
the assistance of Takao Umemoto. 


2 or 3 seconds after the stimulus card 
was shown, and to guess a number if 
they didn't know the answer. The 
result of this practice is that the ex- 
posure time cannot be specified ex- 
actly; however, it was noted that the 
subjects followed instructions and re- 
sponded within 2 or 3 seconds in the 
large majority of cases. 

For all three training conditions, if 
the subject gave the correct number to 
à card, the experimenter repeated it, 
eg., "three is correct The three 
conditions differed on trials when the 
subject responded incorrectly: in the 
correction conditions, the experimenter 
said the correct number; in the non- 
correction conditions, the experimenter 
said "Wrong" following errors; and 
in the partial correction condition, the 
experimenter said two numbers fol- 
lowing errors, €g., "one or three is 
Correct." The subjects in the partial 
conditions were informed that only one 
of the numbers the experimenter said 
would be correct and the other number 
Was a distractor (chosen at random) 
and that half the time the Correct num- 
ber would be said first and half the 
time it would be said second. The 10 
stimulus cards were shuffled for 15 
seconds between runs through the 
deck. The entire list was repeated 
until the subject went through two 
consecutive cycles without any errors. 

The numbers of subjects in each 
group were 14 in the 3-C, 8-C, and 
8-NC groups; 15 in the 3-P and 8-P 
groups; and 16 in the 3-NC group. 
Assuming that each item and each 
Subject may be characterized by the 
Same value of c the data consist of 
140, 150, or 160 sequences of correct 
and incorrect responses for the vari- 
ous experimental conditions. Before 
the model could be evaluated, the 
learning rate constant, c, had to be 
estimated. The data from the 3-C 
group were selected for this purpose; 
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the least squares estimate of c = .218 
was obtained by fitting the learning 
curve (ie, percentage correct vs. 
trials) for this group. 

The first predictions concern the 
average total errors per item before 
achieving the learning criterion. The 
predictions for the correction and non- 
correction groups were obtained by 
summing Equations 1 and 3, respec- 
tively, over trials. The predictions for 
the partial groups were obtained by 
calculating the average errors before 
the first success (using c/2 as the rate 
constant) and the average errors fol- 
lowing the first success (using c as the 
rate constant) and then adding these 
two numbers. The observed means 
(M) and standard errors of the means 
(oy) are shown in Table 1 along with 
the predicted means (P). 

It is clear from Table 1 that the 
theory performs adequately in predict- 
ing average total errors using the 
single estimate of c. In all six cases 
the predicted value falls within one 
standard error of the observed mean.? 
The differential effect of N upon error 
rate is amplified as one proceeds from 
the correction through the partial to 
the noncorrection training procedures. 

A second indication of the goodness 
of fit of the theory is shown graphically 
in Figures 1 and 2 which present ob- 
served and predicted values for cumu- 
lative errors as a function of trials. 
The smooth curves connect the pre- 
dicted values while the unconnected 
points are the empirical values. The 
predicted values for the correction and 
noncorrection conditions were obtained 


5'The standard errors for the 3-C and 
8-C groups can be predicted by the theory 
and turn out to be somewhat larger than 


the observed values (predicted values were 


24 and .30 for 3-C and 8-C groups, respec- 
ble variance pre- 


tively). Deriving compara 1 
dictions for the partial and noncorrection 
conditions presents complex mathematical 


problems which have not been solved to date. 


TABLE 1 
AVERAGE TorAL NUMBER Or Errors PER 
ITEM OBSERVED AND PREDICTED 


Response alternatives 


Training 
condition 3 8 
M = 2.94 M = 3.95 
Correction P = 3.03 P =4,00 
ou = .19 ex = .25 
M = 3.74 M = 5.95 
Partial P -— 3.88 P = 5.80 
ou = .21 ex = .35 
M = 5.65 M = 9.42 
Noncorrection P = 5.59 P = 9.54 
ou = .34 ov = .61 


by cumulating successive values of 
error probabilities calculated from 
Equations 1 and 3, respectively. The 
theoretical curve for the partial cor- 
rection conditions is not expressible 
in a simple equation but successive 
values can be obtained. The first sev- 
eral values will be derived here to il- 
lustrate the procedure. Define Q, as 
the probability that the item is not yet 
conditioned by the beginning of Trial 
n. The probability of an error on 
Trial n would then be (1— 1/N)O,. 
The first several values of Q, for 
the partial correction conditions are: 


by assumption 


Q=1 
a= 4tu-9+(1-%) 


x (1 - i) 
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Fic. 1. Average cumulative errors per item plotted as a function of trials for subjects 
having three response alternatives. (The smooth curves represent predicted values; the 
dots near a curve represent the corresponding observed values.) i 
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Fic. 2. Average cumulative errors per item plotted against trials for subjects having 
eight response alternatives. (The smooth curves represent predicted values; the dots near 
a curve are the corresponding observed values.) 
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The value of Q, is the sum of two 
probabilities: (a) that a correct guess 
occurs on Trial 1 but conditioning 
failed to occur with probability 1 — c, 
and (b) that subject guessed incor- 
rectly on Trial 1 and fails to condition 
the correct response with probability 
1—c/2. The value of Q, is the sum 
of the joint probabilities of three 
events: (a) correct on Trial 1, fails to 
condition on Trials 1 and 2; (b) error 
on Trial 1, success on Trial 2, fails to 
condition after either trial; (c) errors 
on both trials, not conditioned on 
either trial. The general rule for ob- 
taining Qn is that prior to the first 
correct response the conditioning prob- 
ability is c/2 and that on the trial of 
the first correct response and there- 
after the conditioning probability is c. 
By multiplying these On values by 
1 — 1/N and summing them, the pre- 
dicted curves for the partial correction 
conditions are obtained. As Figures 
1 and 2 show, for five of the six 
groups the fit of predicted to observed 
values is satisfactory; the correspond- 
ence to the 8-NC points would not be 
impressive were it not for the fact 
that the same c constant was used in 
generating all the curves under appro- 
priate boundary conditions. 
Equations 2 and 3 for the noncor- 
rection conditions were derived with 
the assumption that no change resulted 
from nonreinforced trials. To the 
extent that these equations 
describe performance in the noncor- 
rection conditions, the assumption of 
no change is supported. However, 
additional data bearing upon this as- 
sumption may be obtained by analysis 
of the early trials before learning has 
occurred. Beginning with the first 
trial, consider the sequence of proba- 
bilities of a correct response follow- 
lug: Oy Jy 25 aee consecutive errors. 
As soon as a subject responds cor- 
rectly to a given item, that item is 


adequately, 


TABLE 2 


PROBABILITY OF A SuccEss FOLLOWING 7 
CONSECUTIVE Errors FROM THE FIRST 
TRIAL FOR THE NONCORRECTION 


Groups 
lad 3-NC ' &NC 
0 .34 12 
1 31 11 
2 .28 14 
3 31 17 
4 31 13 
5 37 16 


dropped from later computations. If 
the randomization of responses to 
stimuli has been effective, then the first 
value of this series should be close to 
1/N, the probability of guessing the 
correct number by chance. Moreover, 
if the assumption of no change is cor- 
rect, then the probabilities of a success 
following 1, 2, 3,... consecutive 
errors should remain around 1/N for 
the noncorrection groups, deviating 
from this value only by random sam- 
pling fluctuations. The results of this 
analysis are shown in Table 2 for the 
noncorrection groups. It should be 
realized that going down the column 
these estimates become progressively 
less reliable because the number of 
cases involved (beginning with 160 or 
140) decreases. These series of esti- 
mates for both three and eight response 
alternatives reveal no significant trends 
away from the first value. Hence, 
the results provide additional support 
for the assumption of no change fol- 
lowing nonreinforced trials in the non- 
correction conditions. 

An analysis related to that above 
can be given for the correction and 
partial correction groups. Because the 
success rate was higher for these 
groups, there was insufficient data to 
obtain reliable estimates of single-trial 
probabilities such as shown in Table 2 
for the noncorrection groups. How- 
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TABLE 3 


PREDICTED AND OBSERVED CONDITIONAL 
PROBABILITIES OF SUCCESS FOLLOWING 
AN ERROR ON THE PRECEDING TRIAL 


Response alternatives 
Training 
condition 3 8 
Observe| Predict| Observe| Predict 

Correction 48 48 35 35 
Partial 

Before first 

success 35 35 E .22 

After first 

success A6 .48 .38 .33 


ever, this difficulty may be circum- 
vented by considering over all trials 
the probability of a success given an 
error on the preceding trial. For the 
correction group, this probability 
should be a constant, c 4- (1— C) Po 
where po is the probability of guessing 
correctly. For the partial correction 
conditions, the probability of a suc- 
cess following an error should be c/2 
+(1—c/2)p, before the first suc- 
cess and should be c + (1 — c), after 
the first success. Such predictions are 
very sensitive to small changes in Po 
Our efforts to randomly assign re- 
sponses to stimuli were directed to- 
wards making f, be close to 1/N in 
value. However, with even a mod- 
erate-sized sample, small deviations of 
the actual probability of correct guess- 
ing inevitably occur. Hence, a fairer 
test of the model (rather than a test 
of the randomization procedure) is 
provided by predictions based on em- 
pirical estimates of po Using the 
first-trial estimates of p, (which on 
the whole were close to 1/N), the 
predicted probabilities of success fol- 
lowing an error were obtained and are 
compared with the observed values 


in Table 3. The correspondence be- 
tween predicted and observed values is 
quite satisfactory; we may note that 
the rather complex assumptions about 
conditioning in the partial correction 
conditions receive substantial support 
from these data. 

Another way to test the assumptions 
of the model involves predicting the 
number of errors before the first suc- 
cess. Since for the partial and non- 
correction groups the stochastic proc- 
ess is not complicated prior to their 
first success, it is possible to predict 
both the mean and variance of this 
statistic for these groups. Define F, 
to be the number of errors before the 
first success, and po to be the prob- 
ability of a correct guess. Then it 


can be shown (Bower, 1961) that the 
mean and variance of F, for the non- 
correction groups are: 


1— po 1 — po 
M = x Y pu [5] 


Similarly, for the correction groups, 
the mean and variance of F, are: 


MEC 
uud Tto cm EC 


V= M+ IPL — 2c) 


The expressions for the partial cor- 
Tection groups are the same as in 
Equation 6 except with ¢ replaced by 
c/2. 

The predicted values of the mean 
and standard error of F, for the six 
Sroups are shown in Table 4 along 
with the observed means and standard 
errors. For the noncorrection groups, 
P» Was estimated as the average of the 
values in Table 2 since the theory says 
those values represent random fluc- 
tuations around po. 

Inspection of Table 4 shows that in 


a 


22 
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the six cases the predicted mean F, 
is within a standard error of the ob- 
served mean, and the predictions of 
the standard errors are also close to 
the observed standard errors. In gen- 
eral, the average F, increases with N 
and the differential effect of N is 
amplified under the partial and non- 
correction training conditions. 

It should be noted that at an em- 
pirical level N has its primary influ- 
ence upon errors before the first suc- 
cess, and has practically no influence 
upon errors following the first success. 
The reader may convince himself of 
this fact by subtracting the mean Fy 
values in Table 4 from the mean total 
error values in Table 1. At first 
glance, this observation seems to in- 
validate the model by showing that N 
only influences the discovery stage but 
not the fixation stage of learning after 
the first correct response has occurred. 
In fact, however, theoretical predic- 
tions of average errors following the 
first success correspond closely to the 
observed values; of course, this must 
be so since the theory predicts F, and 
total errors reasonably well. The rea- 
son we expect errors following the 
first success to be relatively constant 
over N is that, according to the theory, 
a sampling bias is involved in com- 
paring the three and eight alternative 
conditions at this point. Errors fol- 
lowing the first success will occur only 
for items whose first correct response 
came about by guessing, and for these 
guessed correct items we expect more 
errors with eight than with three re- 
Sponse alternatives. Let gx represent 
the probability that with N alternatives 
the first correct response comes about 
by guessing and, let ex represent the 
average number of subsequent errors 
ior an item whose first correct re- 

F Then 


sponse occurred by guessing- 
the expression for the average number 


of errors following the first correct 


response (call it wy) is: 


wy = grey + (1 — gx):0 
= gnen [7] 


That is, a proportion gy of first cor- 
rect responses come about by guess- 
ing and we expect an average of ey 
more errors for these items; a propor- 
tion 1—gy of the first correct re- 
sponses come about by conditioning on 
the preceding trial and for these items 
we expect zero subsequent errors. Ac- 
cording to the theory, es will be larger 
than e; however, gs will be smaller 
than g,; that is, given a first correct 
response, it is more likely to have 
occurred by guessing when N is three 
than when N is eight. Therefore, the 
fact that the products ge, and gses are 
approximately equal is neither surpris- 
ing nor a refutation of the theory. 


DISCUSSION 


The overall results provide strong 
support for the validity of the model 
and the specific hypotheses about ex- 


TABLE 4 
OBSERVED AND PREDICTED MEAN AND 
STANDARD ERROR OF THE NUMBER 
or Errors BEFORE THE FIRST 
SuccEss 


Response alternatives 


Training a 
condition 3 8 
M = 1.33 M = 2.25 
s F = 1.39 F= 2.37 
Correction ex = 092 ex = 185 
op = 25 or = .180 
M = 2.12 M = 4.15 
. F = 2.16 F = 4.02 
Partial cu = 192 cuc .289 
op = .191 op .329 
M = 2.17 M = 6.20 
x 5 F = 2.15 F = 6.38 
Noncorrection ex m 201 ear "1 
ep = .206 or = .558 
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perimental variables. The data of six 
independent groups were adequately 
reproduced using a single estimate of 
the learning rate constant obtained 
from one of these groups. The fact 
that in 24 independent predictions the 
model came close to the data is suf- 
ficient justification for exploring fur- 
ther consequences of the theory. 

The initial problem that led to this 
investigation was whether the number 
of response alternatives could be 
shown to affect learning rate on rein- 
forced trials in addition to contributing 
to differential error probabilities due 
to guessing on unlearned items. The 
data supported the assumption that 
learning rate on reinforced trials was 
a constant independent of the number 
of response alternatives, and that the 
effect of N upon error rate could be 
attributed to differential guessing 
probabilities on unlearned items. It 
should be noted that these conclusions 
are valid only for the experimental 
procedure used here in which the re- 
sponse alternatives are immediately 
available to the subject. For alterna- 
tive procedures in which the subject is 
required to learn the responses (e.g., 
three or eight nonsense syllables) as 
well as the S-R associations, a number 
of complicating factors enter to ob- 
scure the picture and prevent resolu- 
tion of the basic problem. 

The second problem of this study 
was to account for the effect on per- 
formance of several training proce- 
dures. One may conceive of the cor- 
rection, partial correction, and non- 
correction procedures as varying in 
the degree to which the correct re- 
sponse is specified following an in- 

correct response by the subject. This 
variable in turn determines the prob- 
ability that the correct response is re- 
inforced following incorrect responses. 
Thus, in the correction condition, the 


reinforcement probability is 1 on every 
trial; under partial correction, the re- 
inforcement probability is 1/k before 
the first correct response and is es- 
sentially 1 afterwards due to a high 
recognition probability; under noncor- 
rection, the probability that the cor- 
rect response is reinforced is essen- 
tially zero following errors. The data 
supported these hypotheses and gave 
additional support to the assumption 
that when the correct response was re- 
inforced, the probability of condition- 
ing was a constant, c, the same for all 
three conditions. 

In addition to confirming these spe- 
cific assumptions about N and training 
conditions, the results lend support to 
the learning model in which these as- 
sumptions are embedded. In this one- 
element model, the probability of the 
correct response can have only two 
values, 1/N or 1. The stimulus ele- 
ment begins in the unconditioned state 
and each reinforced trial provides an 
opportunity for the element to become 
associated in all-or-none fashion with 
the correct response. When condi- 
tioning finally occurs, response prob- 
ability jumps from 1/N to 1. Discon- 
tinuous learning theories have been 
contrasted frequently with "response 
strength" theories which assume that 
an associative factor (habit strength 
in Hull’s theory, proportion of condi- 
tioned elements in Estes' linear model) 
increases in a cumulative manner with 
successive reinforcements. According 
to a strength theory, response prob- 
ability make take on a large (possibly 
infinite) number of values ranging 
from 1/N up to 1. It is frequently 
difficult to distinguish this theory from 
the one-element model since they pre- 
dict the same average learning curve 
over a group of subjects and items. 
Indeed, the present results on average 
errors could be fit about equally well 
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by the linear model. Since the dis- 
cussion has been oriented towards the 
one-element model, perhaps it would 
be appropriate to record a few extra 
results which favor this model. For 
these purposes, let us consider two 
sequential statistics for which the two 
models deliver qualitatively different 
predictions. 

The first statistic is the average 
number of errors (to perfect learning) 
following an error that occurs on Trial 
n of the experiment. The one-element 
model makes the rather counterintui- 
tive prediction that this average num- 
ber of subsequent errors is a constant 
independent of the trial number on 
which the leading error occurs. Thus, 
if we observe an error on Trial 10 we 
predict the same number of subsequent 
errors as if we had observed that error 
on Trial 1. The point of the matter is 
that when an error occurs on Trial n 
we know that the item was not condi- 
tioned before Trial n, and we can set 
the clock back to Trial 1 as far as the 
model is concerned in predicting future 
errors on that item. In contrast, the 
associative strength approach (speci- 
fically, the linear model) predicts that 
the number of errors following an 
error on Trial n is a decreasing func- 
tion of n; that is, the greater the num- 
ber of reinforced trials before a par- 
ticular error, the higher the associative 
Strength at that time, and hence the 
fewer the number of subsequent errors 
before perfect learning. 

To obtain a sizable sample on 
Which to test this critical point, data 
from 48 subjects in another 10-item 
Paired-associate experiment ° (with 
N = 2) were pooled with the 3-C and 
present experiment. 
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Fic. 3. Paired-associate data: average 
errors following an error that occurs on 
Trial n, for 121,2, ... 6. (Data points 
are filled dots; predictions of linear and 
one-element models are indicated.) 


results on constancy vs. monotone de- 
creasing aspects of errors following an 
error on Trial n. Using this pool of 
760 response sequences (10 items for 
each of 76 subjects), the distributions 
of the number of errors following an 
error occurring on Trial 1, on Trial 2, 

, on Trial 6 were obtained. The 
analysis was not carried beyond Trial 
6 since the number of cases involved 
was decreasing so that estimates of 
means would be more unreliable. The 
estimates of the average number of 
errors following an error on Trial n 
are shown in Figure 3. 

These data clearly favor the con- 
stancy prediction of the one-element 
model Although the estimates fluc- 
tuate somewhat this can be attributed 
to sampling variability; even the 
largest difference (2.05 vs. 1.95) does 
not approach statistical significance 
(t2049). Also in Figure 3 is 
shown the rough order of magnitude 
of the numbers to be expected from 
the linear model. These predicted 
numbers are not exact because of dif- 
ferent learning rates in the two ex- 
periments that were pooled. How- 


48 Gorpon H. Bower 


ever, the average c value was .25 and 
this value was used to obtain the 
values on the graph. Clearly the 
curve predicted by the linear model is 
quite discrepant from the data; ac- 
cordingly, the one-element model ap- 
pears to give a more adequate descrip- 
tion of these data. 

A second example of this constancy 
effect is shown in Figure 4 which pre- 
sents similar data collected in an ex- 
periment on verbal discrimination 
learning. In this experiment 34 sub- 
jects learned 20 items, an item con- 
sisting of two different nonsense syl- 
lables printed on a card. The subject’s 
task was to read off each card that syl- 
lable arbitrarily designated as correct 
for that item; the subject repeatedly 
went through the items in random 
order until he achieved two consecu- 
tive errorless trials with the entire list. 
Since response learning per se is not 


involved in this situation, the one-ele- 
ment model was expected to apply. 
Since there were two response alterna- 
tives on each card, we make the nat- 
ural assumption that the chance prob- 
ability that subject selects the correct 
syllable is one-half. Detailed com- 
parisons of predicted and observed 
statistics indicated excellent fit of the 
model; the results in Figure 4 are 
representative of the overall accuracy 
of the predictions for this set of data. 
An additional set of paired-associate 
learning data providing a comparison 
of the linear and one-element models 
has been reported in a previous paper 
(Bower, 1961); there again the one- 
element model was clearly superior to 
the linear model in predicting details 
of the data. 

A second measure suggested by 
Estes (1960) for differentiating the 
one-element and linear models con- 
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cerns the probabilities of repetition 
and alternation of responses over a 
series of test trials following training. 
Consider a miniature experiment con- 
sisting of one or two reinforced trials 
followed by two test trials (T, and 
T,) on which the subject is not in- 
formed whether his response is right 
or wrong. The comparison differ- 
entiating the two theories is the prob- 
ability of correct on T, given a correct 
and incorrect response, respectively, 
on T,. According to the one-element 
model, the probability of correct fol- 


lowing incorrect on T, should be 
around the chance guessing level 


(1/N) ; the probability of correct fol- 
lowing correct on T, will be much 
higher, being somewhat less than one 
because we may expect some of the 
Correct responses on T, to have oc- 
curred by guessing. The predictions 
of these conditional probabilities by 
the linear model depends upon 
whether there are substantial subject 
and/or item differences in learning 
rate. Tf for the moment we neglect 
such differences, as we have in testing 
the one-element model, then the linear 
model implies that the conditional 
Probabilities of success following 
either a success or failure on T, will 
be equal. 

Two experiments will be reported to 
this point. The first, performed with 
the assistance of Sharon Gadberry, 
consisted of 36 subjects learning two 
10-item lists (nonsense syllables), the 
responses being the integers 1 to 10. 
Following two presentations of each 
S-R pair, the subjects received two 
test trials with the stimulus member 
The pooled results of 
shown in the first 
Starting with 720 


of each pair. 
this experiment are 
row of Table 5. 

cases (36 subjects on two 10-item 
lists), the probability of correct on T; 
was .586; of those correct, 89.6% were 


TABLE 5 


ResuLts OF Two EXPERIMENTS ON THE 
PROBABILITY OF CORRECT ON T: 
FOLLOWING CORRECT AND n 
FAILURE ON T, 


" Per- Per- 
Number] 15. | centa; 
"— cent ge | centage 
Experiment E pone eee correct 
on Ti tere | «Sen 
correct | failure 
I. Nonsense 
syllables, 10 
number re- - 
sponses 720 | .586 .896 .163 
II. Noun-noun 
pair. 320 | .900 | .995 | .030 


correct on T,; of those incorrect on Ta 
only 16.3% were correct on Ts 
These results support the one-element 
model. Two comments are required 
here: First, the value of .896 for cor- 
rect following correct on T, is about 
what would be expected if 10% of 
the correct responses on T, had oc- 
curred by guessing; secondly, the 
value of .163 for correct following 
incorrect on T, is a little higher than 
the chance level of .100; however, this 
is likely an artifact of assigning stim- 
uli and responses in one-to-one corre- 
spondence so that when k items are 
learned the probability of guessing 
correctly on the unlearned items may 
be close to one in N — instead of the 
a priori value of one in N. 

The difference between probabilities 
following a success 
and failure on T, should be increased 
above those in the previous study if 
one requires the responses per se to be 
learned (e.g. as in word-word pairs). 
Under these circumstances, the chance 
level of guessing will be practically 
zero. Thus, if the subject fails to re- 
spond correctly on. T,, the probability 
of being correct on T, is essentially 
zero; by the same reasoning, if a cor- 
rect response occurs on T, it is most 
likely to have come about by condi- 


of success on T, 
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tioning so that the probability of cor- 
rect on T, for this item should be 
essentially unity if forgetting is neg- 
ligible in this situation. Row two of 
Table 5 reports the results of an ex- 
periment performed under these con- 
ditions with the assistance of Judith 
Slagter. Thirty-two subjects received 
three presentations on each of 10 noun- 
noun pairs (e.g., moon-pin) followed 
by two test trials. The percentage 
correct on T, was substantially higher 
than in the first experiment with non- 
sense syllable-number pairs. How- 
ever, the important points of interest 
are that the probability of correct on 
T, given a correct on T, is essentially 
unity, whereas the probability of cor- 
rect following failure on T, is nearly 
zero, 

One objection frequently raised to 
these foregoing comparisons is that if 
there are substantial subject and/or 
item differences in learning rates, then 
arguments based upon the pooled ag- 
gregate will tend to favor the one- 
element result. The point of the ob- 
jection is this: if the linear model 
holds, and if differences in learning 
rates lead to a distribution over sub- 
jects of response probabilities, then 
considering the entire aggregate the 
conditional probability of a success fol- 
lowing a success is expected to be 
higher than the conditional probability 
of success following a failure. This 
happens, according to the argument, 
because when we conditionalize upon 
the first success we are selecting pre- 
dominately those protocols from the 
upper end of the distribution of re- 
sponse probabilities. However, grant- 
ing that this argument is sound, it 
nevertheless helps not at all in res- 
cuing the linear model from such data; 
it simply shifts the focus of the argu- 
ment from conditional probabilities to 


joint probabilities of pairs of re- 
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sponses on T, and T,. For conveni- 
ence in the following, let p, represent 
the joint probability of a success on T, 
and a failure on T,, and let pọ, be the 
joint probability of failure on T, and 
success on T,. It is a simple matter 
to show that if the linear model holds, 
then 5, should equal p, regardless of 
possible differences between subjects 
and/or items in learning rates or initial 
probabilities. Data from four studies 
previously published by Estes (1960, 
1961) show that the required identity 
of py. and po, fails to appear in any of 
the four studies. For example, in one 
study (Estes, 1960, Figure 2, p. 215) 
the value of p,) was .132 while pọ, was 
046; in another study (Figure 6, p. 
218) py) was .098 while p,, was .003. 
A variety of ad hoc hypotheses about 
individual differences to supplement 
the linear model have been considered 
in a paper by Estes (1961) ; the out- 
come of those investigations was that 
no simple ad hoc supplements could 
bring the linear model into corre- 
spondence with these elementary data 
which were collected to test it directly. 

The question about individual dif- 
ferences raised by the objection above 
can be answered by comparison with 
the variability expected from the 
model. Suppose this question is con- 
sidered in the context of a more ex- 
tended experiment such as that de- 
scribed for the 3-C group in the pre- 
ceding pages. A summary measure of 
an individual's performance might be 
the total errors (T) he makes over the 
10 items. If we assume within the 
model that all subjects and items are 
characterized by the same learning 
rate constant, c, then a subject's T 
Score represents the sum of 10 values 
sampled randomly and independently 
from a distribution which allegedly is 
the same for all subjects and items. 
The theoretical variance of these T 


——— HM € t M 


ASSOCIATION MODEL For Parrep-AssociATE LEARNING 51 


scores may be calculated using our 
single estimate of c; this predicted 
variance may then be compared with 
the variance of the observed T scores 
by taking their ratio which will be 
distributed approximately as the F sta- 
tistic. If the observed variance of T 
scores is much larger than that pre- 
dicted, then we would tend to at- 
tribute the inflated variance to varia- 
tions in c over subjects. This test has 
been applied to several sets of paired- 
associate learning data collected by the 
writer, including the data from the 
3-C and 8-C groups of the present 
study for which the variance of T 
could be predicted. With the popula- 
tion of college students who have 
served as subjects in these various ex- 
periments, this test statistic has al- 
ways yielded nonsignificant F values; 
that is, the variance of T between sub- 
jects does not differ significantly from 
What would be expected on the basis 
of random sampling from the stochas- 
tic process assumed by the model to 
be common to all subjects. A similar 
Statement can be made about anal- 
ogous tests for differences in item 
difficulty ; this outcome was expected, 
of course, since precautions were 
taken in selecting stimuli which ap- 
Peared to be equal in intralist simi- 
larity and association value. Alterna- 
tive procedures for handling this ques- 
tion regarding subject and/or item 
differences ii the RIT expeument 
by Estes, 

Opkins, and Crothers (1960) 

In concluding this report, 
* appropriate to add a few general 
Comments concerning the issues under 
discussion lest some misunderstand- 
ings arise, First, it is misleading to 
cast the issues under discussion at the 
level of all-or-none theories versus 


Tesponse-strength theories of learning ; 
ather the point at issue is whether the 


appropriate model for the present ex- 
perimental situation is a two-state or 
a multistate process, where "state" 
here refers to a particular value of re- 
sponse probability. The continuous 
linear model is the limiting case of 
the class of multistate models. The 
all-or-none assumption about condi- 
tioning with respect to the available 
stimuli is not a differentiating feature 
of these two classes of models. Since 
the early writings of Guthrie (1930) 
it has been clear that the all-or-none 
conditioning assumption would imply 
gradual and cumulative changes in re- 
sponse probabilities provided there is 
sufficient variability in the stimulus 
samples from trial to trial Within 
the framework of statistical learning 
theory, the feature differentiating the 
two classes of models is how many 
independent stimulus components one 
must assume to represent adequately 
the course of learning in this or that 
experimental situation. The two-state 
model proposed here for elementary 
association learning assumes that each 
item may be represented by a single 
stimulus element within the model and 
that this element can be in either of 
two states. That the number of ele- 
ments is the critical assumption is 
illustrated by the fact that the statis- 
tics in Figures 3 and 4 and in Table 5 
would not discriminate qualitatively 
between the linear model and a two- 
element model (i.e., where each item 
is represented by two stimulus com- 
ponents, with a random one being 
sampled on each trial). For more ex- 
tensive discussions of small element 
models the reader may refer to a 
paper by Estes (1959) or a book by 
Suppes and Atkinson (1960). 
Secondly, it should be recognized 
that in this report we have demon- 
strated only that the one-element 


model adequately describes results 
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from elementary ^ paired-associate 
learning experiments. The job of ex- 
tending the range of applicability of a 
theory is an empirical project which 
must proceed piecemeal. In large 
part, the success in applying the model 
to a given learning situation will de- 
pend upon the simplicity of the situa- 
tion and the degree of experimental 
control over stimulus variables. To 
cite a pertinent illustration: in training 
a rat to shuttle in response to a buzzer 
to avoid shock in a Mowrer-Miller 
shuttle box, conditioning is typically 
a gradual, “multi-stage” process (cf. 
Mowrer, 1960, p. 36); however, if 
experimental conditions are drastically 
simplified and precautions are taken 
to eliminate possible sources of inter- 
fering responses, then avoidance con- 
ditioning is a one-trial, “two-state” 
process (Maatsch, 1959). An anal- 
ogous illustration for the eyelid condi- 
tioning situation has been reported by 
Voeks (1954). 

In contrast to those situations in 
which a two-state behavioral process 
may possibly be achieved through ex- 
perimental control, there is a large 
number of common learning situation 
for which such precise stimulus con- 
trol probably can not be achieved, and 
it is unlikely that a simple two-state 
model could apply. These situations 
may be characterized generally as ones 
in which exposure to discriminative 
stimuli is subject-controlled; these sit- 
uations range from those in which the 
critical cues are proprioceptive stimuli 
from the subject’s current behavior, as 
in motor skill learning, to situations in 
which selected components of a com- 
plex stimulus array control behavior 
through the mediation of an overt or 
implicit observing response, as in com- 
plex concept learning. One example 
of this latter class would be that of 


a rat learning a black-white discrimi- 


nation in a T maze; the stimuli effec- 
tive at the moment of choice are mul- 
tiple and vary from trial to trial de- 
pending upon the subject’s vicarious 
trial and error behavior. For such 
situations more elaborate conceptual 
apparatus (cf. Audley, 1960; Bower, 
1959; Spence, 1960) is required to 
analyze the interaction between ob- 


serving behavior and instrumental, 
goal-directed behavior. 
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Testing the null hypothesis, Ho, against 
alternatives, Hı, is well established and 
has a proper place in scientific research. 
However, this testing procedure, when 
it is routinely applied to comparing 
experimental outcomes with outcomes 
‘that are quantitatively predicted from a 
theoretical model, can have unintended 
results and bizarre implications. This 
paper first outlines three situations in 
which testing Ho has conventionally been 
done by psychologists. In terms of the 
probable intentions or strategy of the 
experimenter testing Ho turns out to be 
an appropriate tactic in the first situa- 
tion, but it is inadequate in the second 
situation, and it is self-defeating with 
curious implications in the last situation. 
Alternatives to this conventional pro- 
cedure are then presented along with the 
considerations which make the alterna- 
tives preferable to testing the usual Ho. 


THREE APPLICATIONS or Ho TESTING 


Probably the most common applica- 
tion of the tactic of testing Hy arises 
when the independent variable has pro- 
duced a sample difference or set of 
differences in the magnitude of the de- 


1The author is indebted to Arnold M. 
Binder of Indiana University whose argu- 
ments inspired him to make explicit some of 
the issues involved in using conventional 
analysis of variance procedures in testing the 
adequacy of a theoretical model. As this 
paper went through various revisions over a 
period of time the writer is correspondingly 
indebted to a number of supporting agencies: 
the Graduate Research Committee and the 
College of Letters and Science of the Uni- 
versity of Wisconsin, the National Science 
Foundation, and finally to the Department of 
Psychology of the University of California, 
Los Angeles, his host during final preparation 
of the manuscript. 


54 


pendent variable. Quantitative predic- 
tions of the size of the difference or 
differences are not available. The ex- 
perimenter wishes to know whether or 
not differences of the size obtained could 
have occurred by virtue of the operation 
of the innumerable nonexperimental 
factors conventionally designated as 
random. He sets up Ho that the differ- 
ences are zero; chooses a significance 
level, æ; determines the set of hypotheses 
alternative to Ho that he is willing to 
entertain, Hj; selects an appropriate test 
statistic, t, F, x?, U, T, or the like; and 
proceeds with the test. Rejection of Ho 
permits him to assert, with a precisely 
defined risk of being wrong, that the 
obtained differences were not the product 
of chance variation. Failure of the test 
to permit rejection of Ho, which, un- 
fortunately, is commonly termed ''ac- 
cepting" Ho, means that the obtained 
differences or greater ones would occur 
by chance with a probability greater than 
a, This situation is straightforward. 
The experimenter has limited aims. He 
has asked a simple question, and he has 
received a simple answer, subject only to 
those ambiguities which attend all ex- 
perimental and inductive inference. His 
tactics are admirably suited to his stra- 
tegic objective. 

Another common but less satisfactory 
instance of testing Ho arises when the 
results of pre-experimental matching or 
pretesting are to be evaluated. Here 
the experimenter has measured the 
dependent variable or some related vari- 
able before operation of the independent 
variable, and he devoutly hopes that 
the experimental and control groups are 
alike except for random differences. He 
is now relieved or chagrined, depending 
upon whether Hoy is "accepted" or 
“rejected” as a consequence of his test. 
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Even if Ho is accepted his relief is tem- 
pered by some uneasiness. He knows 
that he has not proved, and indeed can- 
not prove, that Ho is “true.” His 
tactics in testing Ho seem to be appro- 
priate to the impossible strategic aim of 
proving the truth of Ho. Certainly, if 
he had a more reasonable aim he has 
adopted inappropriate tactics. Utilizing 
these tactics, the best he can do is to 
beat a strategic retreat, and if He is 
accepted he can perhaps point out that 
he has used a very powerful test and that 
if there were real differences they were 
most likely very small. Although psy- 
chologists have never to my knowledge 
done so, he might be able to go one step 
further and point out that his testing 
procedure would reject Ho a given per- 
centage of the time, say, 90%, if the 
true" difference had been as little as, 
say, one-tenth of an SD. This sort of 
Statement of the power of a test is a 
commonplace in acceptance inspection 
(Grant, 1952, Ch. 13). 

With the advent of more detailed 
mathematical models in psychology (e.g. 
Bush, Abelson, & Hyman, 1956; Bush & 
Estes, 1959; Goldberg, 1958; Kemeny, 
Snell, & "Thompson, 1957) a new sta- 
tistical testing situation is arising more 
and more frequently. The specificity of 
the predictions and perhaps the whole 
philosophy behind model construction 
pose a different kind of statistical prob- 
lem than those faced by most psycho- 
logical investigators in the past. It 
seems obvious that as the use of models 

ecomes more widespread a greater 
number of investigators will face the 
Problem of evaluating the correspondence 
between empirical data points and pre- 
cise numerical predictions of these points. 
Unfortunately most of the procedures 
used to date in testing the adequacy of 
Such theoretical predictions set rather 
ad examples. Probably the least ade- 
quate of these procedures has been that 
In which an Hg of exact correspondence 
between theoretical and empirical points 
1s tested against Hi covering any dis- 
crepancy between predictions and ex- 
Perimental results. 

Most models predict a considerable 


DEPENDENT VARIABLE 


n Xy E" c" x 
INDEPENDENT VARIABLE 


Fic. 1. Idealized situation involving the 
test of a theoretical function, Y' = f(X). 
(Theoretical points, Y"; are represented by 
open circles; obtained means, Yi, are repre- 
sented by solid circles.) 


number of different aspects of the data, 
and some of these aspects are predicted 
with greater success than others (Bush & 
Estes, 1959, Chs. 14, 15, 17, 18). We 
shall restrict our discussion to the pre- 
diction of values along a curve which 
might be a learning curve. An idealized 
version of such a typical situation is 
presented in Figure 1. Here, the de- 
pendent variable, Y, is plotted on the 
vertical axis against the independent 
variable, X, on the horizontal axis. The 
theoretical model has led to an expression, 
y' = f(X), giving a set of k theoretical 
predictions, Y's, Ya, +++, Y's, The ex- 
periment has produced k empirical data 
points, a set of mean values, Yi Ys +", 
F., corresponding to the values of the 
independent variable that were investi- 
gated, namely, Xi Xs ***, X, Indi- 
vidual observations tend to form normal 
distributions about each of the Y; and 
these normal distributions tend to have 
equal o’s for all data points. In further 
discussion we shall assume that inac- `: 
curacies in the manipulation of the 
independent variable, X, can be ignored. 
The problem now is to investigate the 
goodness of fit of the Y'; to the Y; or 
the correspondence between the Y'; and 
the Y; 

The tactics oriented toward accepting 
Hy as corroborating the theory involve 
breaking down the jth individual obser- 
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vation from the general mean of all of the 
observations, as follows: 


F= Y = (Y5— Y) 
"og Fot =y [1] 


where Y;; is the jth observation in the ith 
normal distribution, and Y is the general 
mean of all observations. 


the total sum of squares may then be 
partitioned as follows: 


SStot = SSpev Est 
+ SSpev Theory T SStheory [2] 


where SSpev Est is the sum of squares asso- 
ciated with the variation of individual 
measures from their means, SSpev Theory is 
the sum of squares associated with the 
systematic departures of empirical data 
points from the theoretical points, and 
SSTheory is the sum of squares associated 
with departuresof the theoretical points from 
the general mean of the whole experiment. 


If we suppose that the linear model for 
the analysis of variance holds, then: 


Yi = p+ Tit Did ei [3] 


where u is the population mean for all Y;; 
over the specific values of the independent 
variable, X;; T; is the departure of the 
"true" theoretical value of Y"; from a; D; is 
the discrepancy of the "true" value of Y; 
from the true value of Y; and ej; is a 
random element from a normal distribution 
with a mean of zero and variance, c, 
for all i. 


For a fixed set of X; the Ts and Ds may 
be defined so that XT; = ZD; = 0. 
Under Ho each D; = 0. Under H; some 
D;# 0, and the variance of the D; 
cp';z*0. This last variance may be 
termed the true variance of the dis- 
crepancies from the theory over the 
particular set'of X; that was investigated. 

The foregoing is a conventional an- 
alysis of variance model, and the F ratio 
of the MSpevtheory divided by the 
MSpevEs provides an excellent and 
powerful test of Ho against Hı. The 
number of degrees of freedom for SSpey Est 
will be k(n — 1) where n is the number 
of observations per data point, and the 
degrees of freedom for SSpev Theory will 
be b — nr, where zz is the number of 


degrees of freedom lost in the process of 
fitting the model to the data. If this F 
is significant, we reject Ho, concluding 
that the discrepancies between the Y; 
and the Y"; are too great to be accounted 
for by the observed random variation in 
the experiment. In this conclusion we 
accept the 5% or 1% risk implied by 
our choice of o. 

Logical difficulties arise when F fails 
of significance. Hg remains tenable but 
is not proved to be correct. A tenable 
Ho provides some support for the theory 
but only in the negative sense of failing 
to provide evidence that the theory is 
faulty. To assert that accepting the Ho 
proves that the model provides a satis- 
factory fit to the data is an inaccurate 
and misleading statement. We may 
mean that we are satisfied, but others, 
especially proponents of other theories, 
will tend to regard our test as too lenient. 

Failure to reject Ho, instead of pro- 
ducing closure, leaves certain annoying 
ambiguities, but the tactics of testing 
this particular Ho imply a strategy that 
suffers from more serious defects that are 
readily apparent when the whole concep- 
tion of testing a theory is carefully con- 
sidered. To begin with, in view of our 
present psychological knowledge and the 
degree of refinement of available theo- 
retical models it seems certain that even 
the best and most useful theories are not 
perfect. This means, in terms of the 
analysis of variance model, there will be 
some nonzero D;'s. Ho, then, is never 
really "true." Its "acceptance," rather 
than "proving" the theory, merely indi- 
cates that in this instance the D;'s were 
too small to be demonstrated by the sensi- 
tivity of the experiment in question. 
The tactics of accepting Ho as proof and 
rejecting Ho as disproof of a theory lead 
to the anomalous results that a small- 
scale, insensitive experiment will most 
often be interpreted as favoring a theory, 
whereas a large-scale, sensitive experi- 
ment will usually yield results opposed to 
the theory ! 

Curiously enough, even rejection of Ho 
by means of a very stringent experi- 
mental test may be quite misleading as 
far as casting light on the adequacy of 
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the theory is concerned. If the D,’s are 
very small indeed the theoretical model 
may be a great improvement over any- 
thing else that is available and satis- 
factory for many purposes even though 
an extremely sensitive experiment were 
to reveal the nonzero D;'s. If our task, 
as scientists, were to test and accept or 
reject theories as they came off some 
assembly line the tactics of testing Ho 
could be made in a satisfactory manner 
simply by requiring that the test be 
sufficiently" stringent. In fact, our 
task and our intentions are usually 
different from testing products; what we 
are really up to resembles quality control 
rather than acceptance inspection, and 
statistical procedures suitable for the 
latter are rarely optimal for the former 
(Grant, 1952, Chs. 1, 13). 


HYPOTHESIS TESTING VERSUS 
STATISTICAL ESTIMATION 


_An analogy will make clear the rela- 
tion between testing tactics and the in- 
tention of the tester. Suppose that I 
wish to test a parachute; how should I 
go aboutit? How I should test depends 
upon my general intentions. If I want 
to sell the parachute and am testing it 
only to be able to claim that it has been 
tested, and I do not care what happens 
to the purchaser, then I should give the 
parachute a most lenient, nonanalytic 
test. If, however, I am testing the para- 
chiite £o ‘be sure of it for my own USe 
then I should subject it to a very strin- 
gent, nonanalytic test. But if I am in 
the competitive business of manufactur- 
ing and selling parachutes, then I should 
subject it to a searching, analytic test, 
designed to tell me as much as possible 
about the locus and cause of any failure 
in order that I may improve My product 
and gain a larger share of the parachute 
market. My contention is that the last 
situation is the one that is most analogous 
to that facing the theoretical scientist. 
He is not accepting or rejecting a finished 
theory; he is in the long-term business of 
constructing better versions of the theory. 
Progress depends upon improvement or 
Providing superior alternatives, and im- 
Provement will ordinarily depend upon 


knowing just how good the model is and 
exactly where it seems to need alteration. 
The large D;'s designate the next point 
of attack in the continuing project of 
refining the existing model. Therefore 
attention should be focused upon the 
various discrepancies between prediction 
and outcome instead of on the over-all 
adequacy of the model. 

In view of our long-term strategy of 
improving our theories, our statistical 
tactics can be greatly improved by 
shifting emphasis away from over-all 
hypothesis testing in the direction of 
statistical estimation. This always holds 
true when we are concerned with the 
actual size of one or more differences 
rather than simply in the existence of 
differences. For example, in the second 
instance of hypothesis testing cited at 
the beginning of this paper, where the 
investigator tests a pre-experimental 
difference, he would do better to obtain 
95% or 99% confidence interval for the 
pre-experimental difference. If the in- 
terval is small and includes zero, he 
(and any other moderately sophisticated 
person) knows immediately that he is 
on fairly safe ground; but if the interval 
is large, even though it includes zero, it is 
immediately apparent that the situation 
is more uncertain. In both instances Ho 
would have been accepted. 


TrsriNG A REVISED Ho 


Before turning to estimation pro- 
cedures that are useful in examining the 
correspondence between experimental 
outcomes and predictions from a mathe- 
matical model, I shall digress briefly to 
outline a statistical testing method 
which can legitimately be used in ap- 
praising the fit of a model to data as 
shown in Figure 1. Basically the sta- 
tistical argument in the proper test is 
reoriented so that rejection of Ho con- 
stitutes evidence favoring the theory. 
The new Ho is that the correlation be- 
tween the predicted values, Y'; and the 
obtained values, Y; is zero, after all 
correlation due to the fitting process has 
been eliminated. The alternative, Hi 
against which Ho is tested is that there 
is a correlation. greater than zero be- 


58 THEORETICAL Notes 


tween theoretical and empirical points. 
The four simple steps required to obtain 
the necessary F test are as follows: 


1. Calculate t: = Y'; — Y for all i. 
Zi = 02 

2. Calculate SScorrespondence = 
n(ZtY;/Xt?, where n is the number of 
observations upon which each Y; is based. 
Negative values of (2t;¥;) are treated as 
zero. 

3. Obtain MScorrespondence = SScorre- 
spondence/??r, Where np, the number of 
degrees of freedom involved in fitting 
the theoretical points to the empirical 
data, will ordinarily be the number of 
linearly independent fitting constants 
in the mathematical expression of the 
model. 

4. Divide M SCorrespondence by MSpey Est 
to give Fcorrespondence Which has np de- 
grees of freedom for its numerator and 
k(n — 1) degrees of freedom for its 
denominator, & being the number of F;. 
The test is one-tailed in the sense that 
negative values of Dt;¥; are treated as 
zero values, so that the probability 
values of the F distribution must be 
halved, an unusual procedure with F 
tests in analysis of variance. 


Following the above procedure, rejec- 
tion of Ho now means that there is more 
than random positive covariation be- 
tween predicted and obtained values of 
the dependent variable. 

This test is admirable in that it puts 
the burden of proof on the investigator, 
because a small-scale, insensitive experi- 
ment is unlikely to produce evidence 
favoring the model. Furthermore, if 
the model has any merit, the more sensi- 
tive the experiment, the more likely it 
is that a significant F, favoring the 


*In the unusual event where the general 
mean of the observations, Y, is not used asa 
fitting constant for Y^ = f(X), the t must be 
computed as deviations from the mean of all 
the Y^; Y'. The test will then be insensitive 
to discrepancies between Y" and Y, and the 
interpretation will be somewhat equivocal. 
A separate test of Ho that Y population equals 
Y’, is feasible, but here the experimenter is 
forced into the illicit posture of seeking to 


embrace Ho. 


theory, will be obtained. Actually, the 
test is extremely sensitive to virtue in 
the theory, and therefore in the case of a 
moderately successful model and a mod- 
erately sensitive experiment both this F 
and the one testing the significance of 
systematic deviations from the model 
(F = MSpev Theory/MSpev Est) will tend 
to be significant. This outcome is no 
anomaly; it merely indicates that the 
model predicts some but not all of the 
systematic variation in the data. In 
short, progress is being made, but im- 
provement is possible. The fact that 
Simultaneous significance of both Fs, 
indicating general success and specific 
failures of a model, should be a common- 
place points up the necessity of turning 
to methods of statistical estimation for a 
more adequate examination of the work- 
ings of a theoretical model. 


PRACTICAL EsTIMATION METHODS FOR 
INVESTIGATION OF MODELS 


As is true of statistical tests, each 
method of statistical estimation has its 
advantages and limitations. In the 
investigation of the adequacy of theo- 
retical curves in psychology there are 
reasons to believe that the simpler 
estimation methods have Practical ad- 
vantages over some of the more elegant 
procedures. To give a fairly complete 
view of the situation, methods of point 
and interval estimation of op? and of 
the individual D; will be described, and 
a brief evaluation of each method will 
be given. 

Estimating cp*. The variance of the 
discrepancies between the Y'; and the 
Y; condenses into a single number the 
adequacy of fit of the theoretical model. 
As such it is an excellent index for the 
evaluation of the model. The smaller 
the variance, ap’, the better the model, 
and vice versa. As an estimate of the 
size of the discrepancies one might ex- 
pect in future similar applications of the 
model, øp? is far more informative than 
any F test. Furthermore øp? is readily 
estimated in the case of homogeneity of 
the error variance, o. The expected 
values of the relevant mean squares are 
as follows: 
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Exp(MSpev theory) = oê + nov? [4] 
Exp(MSpey Est) = o4 L5] 


A maximum likelihood estimate of the 
variance of the discrepancies, ép? is then: 


Ên? = (MSpev theory — MSpev Est)/” [6] 


The accuracy of this estimator depends 
upon the number of degrees of freedom 
associated with SSpev Theory and SSpev Est- 
The latter rarely poses any practical 
Problem, but the former, in view of the 
Predilection of psychologists for mini- 
mizing the number of data points, is 
quite critical. This is readily apparent 
When interval estimation of cp is 
attempted. 

Bross (1950) givesa convenient method 
or accurate approximation of the fidu- 
cial interval for op2, and in this case the 

ducial and confidence intervals are 
essentially equal. The method will be 
outlined below for the 5% interval. 


1. Obtain ép? from Equation 6, above. 
(If the estimate is negative or Zero 
meaningful limits cannot be obtained.) 

2. Find: 


R zd 


L = Poor kin) 


F. Foss(k-nr. 9), | 


Foss(e-nr, kin—1)) 
Where; 


F = MSpev Theory/ MSDev Est 


Poss : in the 2.5% 
25(k—np, kai] is the entry 1n 

F table. omm & Hartley, 1954) for 
"uc bo np and m = kn — 1]; an 


PFosq s œ) is the entry form = k— "T 
and ny = œ. 
3. Find: 
1 


L = Fe Foescetnu. kn —— 


F- Foo5(k(n—tl. kn) — 1 
Fozs(o, k-nr) 
Where Fos cunt), k-nr) is the entry i 
<a 2.5% F table for nı = kn — 1] = 
(es k — np; Foto, knr) 5? the entry 
^ — œ, and n — bk — UT, o. 
í 4, The upper and lower Hüte e 
hen Ló p? and Lên’, respectively- i 


less than 15-20 data points these limits 
will be found to be uncomfortably wide, 
a fact to bear in mind when designing an 
experimental test of a theoretical model. 
For example, in Figure 1, with 6 data 
points and two degrees of freedom for 
curve fitting, the limits might plausibly 
be 0-40, whereas with 14 data points the 
limits might be 0-12. 

Aside from the considerable variability 
in the estimate of cp? which can be 
reduced by increasing the number of 
data points, there are two other impor- 
tant limitations to the use of estimates of 
the variance of the discrepancies in 
evaluating a model. First of all, the 
population value of op* is completely 
dependent upon the particular values of 
the independent variable, X, which are 
chosen for the test of the model. Choice 
of two different sets of Xs could well 
lead to two entirely different values of 
cp), and both of these values could be 
perfectly accurate. Secondly, although 
cp? gives an over-all index of the ade- 
quacy of the model being tested, it 
condenses so much information into one 
measure that it does not permit pin- 
pointing the especially large Dij's „50 
that they can be given proper attention 
in considering revision of the model. 

Estimating the Di. The individual D; 
may be estimated as points, orintervals 
may be established for the Dj, collec- 
tively or individually. As before, each 
method has its good points and its 
limitations. Mr : 

Point estimation of the individual Dj's 
consists simply in comparing the indi- 
vidual data points, the f; with the 
fitted curve. It is a crude method, but 
it has served well in the past and repre- 
sents the beginning of wisdom. „For 
example, in Figure 1, the model builder 
might well note that the first three data 
points lie below the curve and ask him- 
self if there 18 some special reason for 
He would also note that the great- 
cy occurs at Ys, where is 

: ing discrepancies are in the 
neigh rection. The weakness of this 
ample method lies in the absence of a 
criterion which will assist the investigator 
in deciding which discrepancies should be 


this. 
est discrepan 
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singled out for further attention and 
which may be disregarded because they 
are within the range of expected random 
variation. This defect is remedied by 
the interval estimation techniques. 

Probably the ideal method of interval 
estimation is that in which intervals are 
established for the whole curve in one 
operation by finding the 95% confidence 
band. The method takes the theoretical 
curve as a point of departure, and the 
result is a pair of curves above and below 
the theoretical curve, which will tend 
in the case of random variation to con- 
tain between them 95% of the data 
points. Points lying outside the band 
are immediately suspect; they are the 
most promising candidates for attention 
in the next version of the model. There 
are two practical difficulties with this 
method. First, homogeneity of the error 
variance, c^, over all the X; is required. 
And secondly, because errors in estima- 
tion of each fitting parameter must be 
taken into account, for all but the sim- 
plest curves (Cornell, 1956, pp. 184—186) 
the bands may be difficult? to obtain. 
Although the method is elegant, in 
practice it will rarely represent sufficient 
improvement over the final method, 
given below, to justify its use. 

The last method seems to me to be the 
most useful and most robust and most 
flexible method. It can be widely ap- 
plied, and the relative ease of applica- 
tion, coupled with its ability to dis- 
criminate between significant and ran- 
dom discrepancies make it superior to 
the other estimation methods. It also 
possesses the homely virtue of being 
readily understood. In contrast to the 
preceding method, this one takes as its 
point of departure the empirical means, 


5 A sufficient estimate of the error variance 
of each parameter must be available and inde- 
pendent of the estimates of all other param- 
eters or else the covariances of all parametric 
estimates must be found and the theoretical 
function must have continuous first partial 
derivatives with respect to the parameters in 
order that the confidence bands may be found 
in the asymptotic case (Rao, 1952, pp. 207- 
208). Where an asymptote is involved in the 
fitting of the theoretical function, satisfactory 


independent estimators can rarely be obtained. 


and consists, simply, in computing the 
95% confidence limits for each of the Y';. 
If there is homogeneity of variance, the 
error variance of each mean is taken 
simply as ¢2/n; in cases of suspected 
heterogeneity, each mean must have its 
own estimate of error variance. This 
will, of course, be the variance of the 
distribution of Y;; for each 7, divided by 
n. When these limits have been ob- 
tained, attention is directed to instances 
where the theoretical curve lies outside 
the limits. In some cases, the investi- 
gator might choose to establish the 80% 
or 90% limits in order to direct his 
attention to less drastic departures of 
the experimental results from the model. 
Choice of an optimum level for the limits 
is hard to establish on a general a priori 
basis, but it is likely that limits narrower 
than the traditional 95% will be found 
more useful than the broader limits. 
Simple as this method is, it is hard to 
improve upon in actual practice. In- 
stead of giving an almost meaningless 
over-all acceptance or rejection. of a 
model, it directs attention to specific 
defects, its functioning improves as the 
precision of the experimental test is 
improved, and the investigator can set 
the confidence coefficient so as to in- 
crease its sensitivity to defect at a cost 
of a fairly well-specified percentage of 
false positives or wild goose chases. A 
final and often crucial advantage is that 
the confidence intervals, based as they 
are upon the experimental means, can be 
obtained in cases where the form of the 
theoretical function does not permit 
satisfactory estimation of its parameters, 
and the analysis of variance and con- 
fidence bands methods cannot properly 
be applied. 


SUMMARY AND CONCLUSIONS 


In this paper I have attempted to show 
that the traditional procedure of testing 
a null hypothesis (Ho) of a zero difference 
or set of zero differences is quite appro- 
priate to the experimenter's intentions OF 
scientific strategy when he is unable to 
predict differences of a specified size. 
When theory or other circumstances per- 
mit the prediction of differences of 
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specified size, using these predictions as 
the valuesin Hais tactically inappropriate, 
frustrating and self-defeating. This is 
particularly true when a theoretical curve 
has been predicted, and Ho is framed in 
terms of zero discrepancies from the 
curve. If rejection of Ho is interpreted 
as evidence against the theory, and 
“acceptance” of Ho is interpreted as 
evidence favoring the theory, we find that 
the larger and more sensitive the experi- 
ment is, the more likely it will lead to 
results opposed to the theory; whereas 
the smaller and less sensitive the experi- 
ment, the more likely the results will 
favor the theory. Aside from this 
anomaly, which can be corrected by re- 
casting Ho in terms of a zero covariance 
between theoretical prediction and ex- 
perimental outcome, hypothesis testing 
as a statistical tactic in this case implies 
an acceptance-inspection strategy. AC- 
ceptance-inspection properly involves ex- 
amination of finished products with a 
view to accepting them if they are goo 
enough and rejecting them if they are 
shoddy enough. The theoretician is not 
a purchaser but rather he is a producer 
of goods in a competitive market so that 
his examination of his theory should be 
from the standpoint of quality control. 
His idealized intentions are to detect 
and correct defects, if possible, so that 
he can produce a more adequate, more 
general theoretical model. Because his 
ideal strategy is not to prove OF disprove 
a theory but rather to seek a better 
theory, his appropriate statistical tactics 
Should be those involving estimation 
rather than hypothesis testing: 
Examination of alternative techniques 
available for point or interval estimation 
Sf discrepancies between theoretical pre- 
dictions and experimental outcomes or 
the over-all variance of these discrepan- 
cies suggests strongly that estimation 0 
the confidence intervals for the means 


found along a theoretical curve is the 
most practical and most widely appli- 
cable general procedure. Other writers 
have recently emphasized the values of 
various estimation as opposed to hy- 
pothesis testing techniques (e.g., Bolles 
& Messick, 1958; Gaito, 1958; Savage, 
1957) and it is hoped that considerations 
pointed out by them and points raised in 
this paper will be helpful to investigators 
who are in the process of examining 
theoretical models which lead to specific 
numerical predictions of experimental 
outcomes. 
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The widespread use of Q sorts makes 
it important that certain difficulties in 
item selection be emphasized. In this 
author’s opinion, these difficulties are 
such as to suggest the discontinuance of 
the use of this technique for research 
purposes, unless corrective measures are 
devised. 

The Q sort is a technique derived from 
Q-methodology practice (Mowrer, 1953; 
Stephenson, 1955). In using this tech- 
nique, comparison is made between an- 
swers to a large number of “tests” (items) 
from different persons (usually judges), 
or from the same person from different 
points of view: “self, ideal self, self as 
seen by mother, etc.” An implicit, under- 
lying assumption is that the tests are 
not inherently correlated. Operationally 
speaking, if two judges agree that the 
quality “shy” is “very characteristic” of 
Person X, then their agreement about 
the qualities “bashful, modest, outgoing, 
extroverted, etc.” is, at least partially, 
predetermined by the inherent correla- 
tions between the items. Now this would 
be all right if all the items were about 
shyness or nonshyness, then agreement 
between judges would mean that they 
agreed on the shyness of X. Character- 
istically this is not the case. Items rela- 
tive to the broad range of personality 
traits are usually included in a Q sort, 
Under these circumstances the inclusion 
of items inherently related will lead to 
spurious correlations between Q sorts. 

This difficulty has been pointed to be- 
fore, but seemingly without appreciation 
of its seriousness. Stephenson (1955) 
wrote: "no set contained direct anto- 
nyms" (p. 257) suggesting that this was 


1The author expresses his thanks to 
Robert J. Wherry, Edwin N. Barker, and 
Shephard Liverant for their valuable com- 
ments on this paper. 


62 


something to be avoided. Later Stephen- 
son ? said: 


you are quite right about... the use of 
direct antonyms. Little is to be gained by 
using words like introversive and extro- 
versive in the same Q-sample, one of them 
is enough. 


Cronbach (1953) said, in discussing cri- 


teria for the selection of items for Q 
sorts: 


statements, while logically bearing on the 
same domain, should represent a large num- 
ber of continua. Correlating persons seems 
to have no advantage if we present items 
which all fall into one scale (p. 380). 


. Two types of statistical distortion are 
involved in this use of highly related 
items. The first is an inflation of the 
correlation coefficient, and the second is 
an inflation of the number of degrees of 
Íreedom. By using the total number of 
presumably unrelated items as the degrees 
of freedom, one markedly inflates the de- 
grees of freedom in testing the signifi- 
cance of the correlation if many of the 
items are, in fact, correlated. A rather 
obvious example would be using only 
items related to a single dimension. If 
30 items concerning a single trait were 
used to judge a person, the correlation 
between judges would be accurate and 
misleading at the same time. It would 
be accurate insofar as the judges' agree- 
ment about the trait, but it would be mis- 
leading about the person insofar as this 
is the only trait being considered. This 
1s pretty obvious; however, now let us 
intermingle these 30 monotrait items 
among 70 other items, these 100 items 
Supposedly representing the broad spec- 
trum of personality traits. Further, let 


? W. Stephenson, personal communication, 
1957. 
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us suppose that two judges have a zero 
correlation on these 70 other items, but 
they have a perfect, 1.00, correlation on 
the 30 monotrait items. The result of 
this intermingling would depend upon the 
placement of the 30, agreed upon items. 
The author devised a set of data to ex- 
amine this problem. Seventy items were 
randomly distributed into the 11-cate- 
gory, approximately normal distribution 
shown in Table 1 and then minor adjust- 
ments were made to obtain an exactly 
zero correlation. The 30 agreed upon 
items were then intermingled in three 
ways. The first way was to place them 
in the extreme categories of the com- 
bined 100-item distribution shown in 
Table 1; the second was to place them 
in the middle categories; and the third 
Was to distribute them proportionately. 
This procedure yields the correlations: 
0.76, 0.11, and 0.30. A correlation of 
0.26 for 100 items is significant at the 
0.01 level, Considering the monotrait 
items as really one item, this example 
shows two types of overestimation : the 
correlations found range between .11 and 
76 instead of from .00 to .06, and the 
degrees of freedom used are 98 (100-2) 
instead of 69 (71-2). 

Let us look at another frequent use of 
the Q sort: self vs. ideal-self comparison. 
Where the monotrait items are rated in 
the same direction on both self and ideal- 
Self Q sorts, the correlation will be over- 
estimated. Where the monotrait items 
are rated in the opposite directions, the 


Correlation will be underestimated. This 


distortion would be augmented if the 
experimenter were comparing Q sorts 
e mono- 


before and after therapy and if th 
trait items shifted their directions. For 
example, let us Suppose that before 
therapy a patient sees himself as shy but 
as ideally not shy, and that after therapy 
he sees himself as not shy both actually 
and ideally, If the Q sort had many 
“shyness” items in it, the difference be- 
tween the pretherapy and posttherapy 
Correlation would be doubly overesti- 
"hated since the pretherapy correlation 
Would have been underestimated and the 


TABLE 1 


lrEM DISTRIBUTIONS 


Number Categories 
of 
Items 
1/2/3]} 4] 5 | 6| 7| S [9|1011 
70 items |2|5] 5| 7| 7|18| 7| 7|5/5|2 
100 items | 3} 7| 7| 10} 10| 26| 10) 10} 7) 7|3 


posttherapy correlation would have been 
overestimated. 

It would seem that a possible hope for 
the comparison of persons, or of different 
concepts of a person, would be the use 
of items whose interrelatedness is known. 
Levy and Dugan (1960) have proposed a 
method which offers a possible system 
for determining the interrelatedness of 
items, An investigator adapting their 
design would ask the subjects, from the 
same population as his experimental sub- 
jects, to describe a photograph with his 
Q-sort items. Each subject would be 
given a different photograph. The items 
could then be intercorrelated and factor 
analysed into orthogonal components. In 
using the Q sort with his experimental 
subjects the investigator could then obtain 
factor scores and use these for his com- 
parisons. Since this is a somewhat 
laborious procedure it would seem to be 
desirable for investigators to come to 
some consensus about the items for two 
or three widely applicable Q sorts. 
Short of the availability of independent 
items or items with known dependencies, 
any kind of Q-sort study is subject to 
this distortion. 


SUMMARY 


It has been pointed out that due to the 
interrelatedness between items which 
constitute a Q sort, investigators obtain 
spurious results due to an overestimation 
of the correlation coefficient and of the 
degrees of freedom in their Q sort. The 
difficulty of determining the correct mag- 
nitude of the correlation coefficient and 
the correct number of degrees of freedom 
to be used indicates the desirability for 
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the development of alternate methods for 
Q-type studies. 
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Various measures in psychology use 

a set of response categories extending 

in two directions about some neutral 
point. Scores for several responses 

are summed or averaged to give a 
composite score. Examples would in- 
clude some of the methods of absolute 

» judgment in psychophysics, the Likert- 
type of attitude scale, Osgood's se- 
mantic differential, rating scales used 
in personality assessment. The ob- 
ject of measurement for the composite 
Scores may thus be some attribute of 
physical stimuli, of verbal concepts 


of other persons, Or of oneself. 
be distin- 


es 


Two components can s 
guished in the responses: (4) the di- 
rection from the neutral point, repre- 
f Senting the basic dichotomy—heavy 


Versus light, agreement versus disa- 


Sreement with the statement, etc. ; ( 


the degree or extremeness of the re- 
Sponse from the neutral point. For 


P This report represents à revision of a 
doctoral dissertation at Harvard University: 
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ouncil; the replication in the United States 
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example, in a six-point Likert scale the 
subject writes + 1, -2,*3;0r—1, 
— 2, — 3 to indicate slight, moderate, 
and strong agreement or slight, mod- 
erate, and strong disagreement. The 
basic dichotomy of direction is deter- 
mined by whether he agrees or dis- 
agrees (writes + oT —); extremeness 
by whether he indicates slight, mod- 
erate, or strong feelings (writes 1, 2, 
or 3). 

A neutral response category may or 
may not be provided—e.g., Likert 
scales traditionally permitted a neu- 
tral response (writing 0), but many 
recent Likert scales do not. 

When scores for several responses 
are summed or averaged to give a 
composite score, the two components 
become: the number (more. „generally, 
the proportion) of responses scored in 
each direction ; the mean extremeness 
of the responses in each direction. 
aper derives à method 


The present p: 
for estimating the relative contribution 


of these two components to composite 
scores, and applies the method to data 
from several Likert attitude scales. 

A possible reaction to the preceding 
might be to grant that the two com- 
described can logically be dis- 
d, but question whether this 
or desirable. The purpose 

extended scale is to in- 


ponents 
tinguishe 
is necessary 
of using an 
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crease the refinement of measurement 
over that given by dichotomous re- 
sponses. What then is the purpose of 
separating direction and extremeness 
components? The answer is that there 
are both theoretical and practical rea- 
sons for considering the components 
separately : 

1. The theoretical reason is that the 
distinction may be a psychological as 
well as a logical one. Ideally, the di- 
rection component would represent a 
qualitative aspect of the object of meas- 
urement, while the extremeness com- 
ponent would represent quantitative, 
intensity, aspects. In reality, both 
components are likely to represent, at 
least in part, response tendencies of the 
subjects rather than aspects of the 
object of measurement. Cronbach 
(1946; 1950, p. 21) distinguishes the 
response sets possibly involved in the 
two components as “bias” or “ac- 
quiescence” for the direction compo- 
nent, and “tendency to go to extremes” 
for the extremeness component. It 
seems desirable to examine these dif- 
ferent factors separately. For ex- 
ample, Cronbach (1950) argues 
strongly that the extremeness com- 


ponent may largely represent a re- 
sponse set: 


The usual argument for the more finely 
divided scale of judgment on each attitude 
item is that it is more reliable and that 
subjects prefer it. If the latter advantage 
is significant, the finer scale may be retained 
and scored dichotomously. The argument 
that the finer scale gives more reliability is 
not a sound one, since this is precisely what 
we would expect if all of the added reliable 
variance were response-set variance and 
had no relation to beliefs about the attitude- 
object in question (p. 22). 


2. 'The practical reason for examin- 
ing the two components separately 
concerns the justification for dichotom- 
ous scoring. Regardless of the the- 
oretical status of the two components, 
if the extremeness component is little 


represented in whatever the composite 
score measures, usc of the simpler 
scoring method based on the direction 
dichotomy would seem justified. 


DEVELOPMENT OF AN APPROXIMATION 
FOR THE VARIANCE OF COMPOSITE 
Scores IN Terms or Direction 
AND EXTREMENESS SCORES 


The development given below as- 
sumes that a neutral response category 
is not provided for. Provision for a 
neutral category would give a closely 
analogous development. 

The composite score used here rep- 
resents the average of scores for the 
Separate responses. An individual's 
Composite score can be expressed 
mathematically in terms of direction 
and extremeness scores: 


C=PX— (1—P) Y 
where: 

C is the composite score. 

P is the proportion of responses in the 
direction scored positively. 

X is the mean extremeness of responses 
in the direction scored positively. 

Y is the mean extremeness of responses 
in the direction Scored negatively. 


For example, a subject makes 10 
responses as follows: +2, +2, 4-2, 
#2, 43, +1, — il, — —2 —5. 
If the responses are scored as they 
stand, then P = .60, X 2200, Y = 
1.75, and C — 4 .50. 

Frequently, to avoid negative num- 
bers some constant is added to all 
Scores. "This simply raises the com- 
posite score and does not affect the 
development given below. 

he present problem now becomes: 


[1] 


- to what extent does variation in the 


composite scores (C) depend on di- 
Tection (P), and to what extent on 
extremeness (X and Y)? 


*If a neutral response category were 
allowed, then Equation 1 should be modified 
to replace 1 — P by Q — the proportion © 
responses in the direction scored negatively: 

e 


— — C HE 
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Ihe variance of a difference in this 
case can be written as: 


ao? = apx? + oT army 


—2 cov [PX, (1 — P)Y] [2] 


The terms on the right involve the 
variances and covariance of products. 
There seems to be no simple method 
of handling this situation. The method 
used here is to approximate the terms 
on the right side of Equation 2 by the 
delta method.t In the present case, 
the assumption is that the deviations 
of individual scores around the group 
means are relatively small compared 
to the means, so that higher powers 
of the deviations can be neglected. 

Let Mp, Mx, and My stand for 
the group means of P, X, and Y. 
These means act as constants as Te- 
gards variations between individual 
Scores, 

Let Ap, Ax, Ay stand for individual 
deviations around the group means: 


P=Mp+Ap 
X=Mx+ Ar 
Y = My + Ay 


These deviations are assumed to be 
small compared to the means. 

The method will be illustrated for 
the first term on the right of Equation 
2, opx?. 

By the definition of any variance: 


ors? = B[(PX)*] - IE DI 
Applying the delta method : 


PX = MpMy + APMx 
+ AxMp + ApAx [4] 


In taking expected values, the terms - 
der of A vanish, 


involving the first oF 
since they are deviations around à 
mean, with an expected value of zero. 


The expected value of apax is the 


Mosteller for first 


4I am 3 F. 
m indebted to ion leading from 


Working out the derivati 
Equation 2 to Equation 
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covariance (Ap, Ax). Hence: 
E(PX)-— MpMx + rasxyrex L5] 


In squaring Equation 4 the terms 
are collected according to the order 
of A involved: 

(PX)? = 

0 order of A: Mp*Mx* 

1 order of A: + 2ApMpMx* 

: + 2A«M. pMx 

2 order of A: + AP?M x? + Az? Mp? 

+ AApAx MpMx 

3 order of A: ) These terms will 

4 order of A: [be neglected. 


In taking expected values, the first 
order terms again vanish. The third 
and fourth order terms are assumed 
to be negligible. The result is: 


E[(PX)*] = Mp y + o? Ms* 
$ ox Mp? + ArasooroxMpMx [6] 


Substituting from Equations 5 and 
6 in Equation 3 gives: 
opx? = op My? + ox°M. p 
eb 2r.p.xyopoxM pM. x 
—rpxorox® [7] 


Equation 7 gives an approximation 
for the first term on the right of Equa- 
tion 2 in terms of the means, variances, 
and covariance of P and X. 

For the second term on the right of 
Equation 2, an analogous application 
of the delta method gives the approxi- 
mation : 
o^a-PY = op My? ED! cy*(1 ka Mp)* 

L 2rpxyorey(1 — Mr) My 
—ryorer Í 


For the third term on the right of 
Equation 2, the approximation isi 


cov [PX, (1 — p)Y] = — op*MxMy 
T rax yyexoyMp(1 — Mp) 
— r.p. xyopoxM pM y 
+ rop yyopoyM x (1 — Mp) 


trexrexorexcr [9] 
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Substituting in Equation 2 from the 
approximations 7, 8, and 9 and col- 
lecting terms gives as the overall ap- 
proximation : 


I Ia 
oc = op? (My + My)? + ox? Mp? 

II, 

+ oy?(1 — Mp)? 
The 

— 2r.x,v,0xoyM p(1 — Mp) 
III, 
+ 2r.p.x,opoxMp(Mx + My) 
Ill, 
— 2r.py,opcy(1 — Mp) (Mx + My) 

III, 

— [Ye.xorox + rip. y opoy]? [10] 

Roman numerals are written above 
the terms to indicate the grouping 
that is relevant for present purposes : 
Term I involves variation of P; Terms 
Ia, Il, and II, involve variation of 
X and Y; Terms III,, III, and IIT, 
involve the covariation of P with X 
and Y. 

(The final Term III, involves a 
higher order of the variables than the 
other terms, and will often be neg- 
ligible. The correlations, and the 
standard deviation of proportion scores 
are necessarily less than 1. The stand- 
ard deviations of X and y also tend 
to be less than 1 if the scale has no 
more than three steps in each direction 
—i.e., a scale of seven steps or less. In 
such cases the final Term III, tends to 
be negligible. For scales that extend 
further, the term might not be neg- 
ligible. ) 

The approximation given by Equa- 
tion 10 can be used to estimate the 
relative contribution of direction and 
extremeness scores to the composite 
scores on different types of bipolar 
scales. Comparison can be made of 
the three groups of terms, and in 


particular, the relative size of Term 
I to the combined Term II. 


APPLICATION TO DATA FROM SIX- 
Step LIKERT ATTITUDE SCALES 


This section applies the approxima- 
tion just developed to data from six- 
step Likert attitude scales. For each 
item, the subject chooses between 
three degrees of agreement by marking 
+1, +2, +3; or three degrees of 
disagreement by marking — 1, — 2, 
—3. Hence X and Y must lie be- 
tween 1 and 3, while the proportion 
scores P lie between 0 and 1. 

Before taking up actual data, it is 
Possible to get a general idea of the 
likely results. If one assumes for ex- 
ample that the means of P, X, and Y 
are near the middle of their possible 
values, then Mp will be around .5, 
and My and My around 2. In this 
case: 


Term I is about 16 op’, 
Term II, is about .25 ox’. 
Term II, is about 25 oy?. 


Term II, is about — Sr x.y oxoy: 


Then with the standard deviations 
still to be evaluated, Term I has an 
advantage of 64 to 1 over Term II, 
or Term II, If one assumes further 
that the o’s are likely to be roughly 
in the ratio of their possible ranges, 
then ex and cy would be about twice 
as large as op. This cuts down the 
advantage of Term I over Term II, 
or IT, to 16 to 1, or an advantage of 
8 to 1 over both combined. Insofar 
aS the correlation Tx, y; is positive, 
the combined Term II would be made 
even smaller by Term II.. 

While in principle, data from any 
set of items could be used, two points 
deserve consideration, 

l. If the items were such that sub- 
jects tended to respond in the same 
direction on nearly all items, then 


—— — —ÓÓÁ 
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TABLE 1 
Basic Dara FOR DIRECTION AND EXTREMENESS SconEs 


Scale 
Score F Dogmatism cibum General 
onservatism Agreement 
Us UK Us UK us UK US UK 
Direction: P 
Mp EU .50 A5 .50 .59 45 58 .62 
op 12 48 10 13 AL 46 .05 .06 
Rel. of P .56 .18 51 .67 .50 .84 45 .89 
Extremeness: X & Y 
Mx 179 | 181 | 1.74 | 175 | 215 197 | 201 | 1.96 
ox 34 31 .29 28 .35 .36 25 23 
Rel. of X .66 Er .62 53 .80 71 94 .88 
My 196 | 190 | 1.94 | 1.83 | 1.80 195 | 195 | L84 
oy .33 .36 82 33 33 33 29 27 
Rel. of Y .64 .70 71 67 .66 .64 .92 OL 
Correlations 
T(P, X) .08 .48 Al .22 .40 A1 | —.07 .00 
TIP, Y) —20 | —.24 | —-08 ‘or | —.12 | —38 | —14 | —16 
fix) yt: A2 46 AT 44 AT .89 .82 


either X or Y would be based on few 
items and would tend to be an tUm 
Stable measure. 

2. To give a representative test, a 
variety of types of scales should be 
"Used. In particular, the test should 
include both the recent type of scale 
deriving from The ‘Authoritarian Per- 
sonality where all items are worde 
in one direction and agreement is al- 
Ways scored positively, and the more 
traditional type of Likert scale where 
items are worded in both directions 
and agreement 1s sometimes scored 
Positively and sometimes negatively. 

Accordingly results are presented 
for four sets of items: 


items from the F 
' (Adorno, Fren- 
Sanford, 1950, 


l. Responses to 28 
RIS of “authoritarianism” 
x tunswik, Levinson, & 
if a subject re- 
direction, either 
ndeterminate- 


5 ICH] 
Sp In the limiting case, 
iie ds to all items in one Ct. 
X or his Y score becomes ! 


pp. 255-237, 260—omitting Items 22 and 
30/28 

2. Respon 
Dogmatism Scale, 


pp. 7-10) 
In both of these scales all items are 


worded in one direction and agreement is 
. always scored positively. Hence scores 
represent à mixture of attitudes toward 
Authoritarianism Or Dogmatism with an 
agreement response tendency (“acquiescent 
response set"). 

3. Responses 


ses to the 40 items of Rokeach's 
Form E (Rokeach, 1956, 


to 32 items referring to 
conservative Versus more leítist positions 
on economic and political issues. Sixteen 
of the statements present à conservative 
position and agreement is scored positively. 
The remaining 16 statements present the 
n the reverse direction (the two 


same issue ir : 1 
versions are given at different times) and 
agreement is scored negatively. Hence the 


combined score should cancel major effects 
of acquiescent response set and give a fairly 
pure measure of attitudes toward Conserva- 


tism. 


4. 


“General Agreement. 
originals and reversa 


items, scored for 
The items include 
Is of the F and Dog- 


Responses to 208 


” 
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TABLE 2 


CONTRIBUTION TO VARIANCE OF COMPOSITE SCORES pv DIRECTION 
AND EXTREMENESS SCORES 


Scaie 
I 
Teien irom F Dogmatism | pation "E MM 
US UK US UK US UK US UK 
I .193 439 | .138 213 .195 .398 .039 .056 
Ila .024 .025 .017 -019 -041 027 .021 .020 
Illy .032 .032 .031 .026 .018 033 .015 011 
Els —.025 |—.007 |—.021 |—.021 |—.024 |—.028 |—.032 |—.021 
IIT. O11 | .100 | .011 | .028 | .072 023 |—.004 | .000 
Illy, 031 | .057 | .010 |—.001 015 | .085 | .007 | .008 
II combined .031 .050 .027 024 .035 .032 O04 007 
III combined .042 457 .021 27 -086 .108 003 .008 
Total: I + 11 + III 265 | .646 | .186 | .265 | .316 | .538 | .o46 | .071 
Actual ec? 276 642 195 -268 300 585 048 .066 
% discrepancy 4% | 0.7% | 5% 1% 5% 8% 2% 7% 
% of Total: | | i 
I: Variance of P 73 68 75 81 | 62 74 85 19 
II: Variance of 
X&Y 12 8 14 9 11 6 10 10 
II: Covariance of P | 
with X & Y | 16 24 11 10 27 20 6 11 


matism items, as well as for 20 items from 
the Anti-Semitism scale (Adorno et al, 
1950, pp. 68-70) ; and both versions of the 
Conservatism items just mentioned. Agree- 
ment is scored positively for all items. 
Hence the combined score should cancel 
major effects of attitude content and give a 
measure of generalized "acquiescent response 
set." 

The subjects were 88 American and 75 
English engineering college students. Re- 
sults for the American sample are headed 
"US"; results for the English sample “UK.” 

Further details of procedure are given 


elsewhere (Peabody, 1961). 


Table 1 gives the means, standard 
deviations, reliabilities, and correla- 
tions for the direction and extremeness 
scores P, X, and Y. 


6 All reliabilities are for split halves sepa- 
rated by a temporal interval, usually 2 
weeks. 


Table 2 gives the evaluation of the 
terms of the approximation Equation 
10. (The final Term III, is omitted 
here since of negligible size.) 

The closeness of the approximation 
is given in Table 2 by the “% discrep- 
ancy" between the sum of the terms 
and the actual variance of the com- 
posite scores, and ranges from around 
176 to 896. This result is about what 
would be expected from this type of 
approximation. 

Although the results in Table 2 
differ in detail from the rough a priori 
estimate made of the relative impor- 
tance of Terms I and II, the final con- 
clusion is about the same. The aver- 
age result for the two samples on each 
set of items gives the following com- 
position of the variance of the com- 
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posite scores: 


, About 7095-8096 involves variation of 
direction scores— Term I 

About 10% involves variation of extreme- 
ness scores—Term II 

, About 1095-2095 involves covariation of 
direction scores with extremeness scores— 
Term III 


This is a ratio of 7-8 to 1 for the 
first two terms. 

Two main factors seem responsible 
for this result: 
_ 1. From the general form of Equa- 
tion 10 variation in one component is 
weighted by the second power of the 
mean(s) from the other component 
(ie, whether a person agrees OF dis- 
agrees with an item will usually have a 
larger effect on his composite score 
than the degree of his agreement or 
disagreement). As in the a priori 
estimate, these weights overwhelm- 
ingly favor the direction variance term 
and more than compensate for the 
smaller size of this variance itseli— 
so that Term I is considerably larger 
than any of the Terms Il. 

2. The correlation between extreme- 
ness scores in the positive and nega- 


negative numbers (strongly disagree). 
Hence Term II, cancels much of the 
effect of the extremeness component. 
This factor becomes particularly prom- 
inent for the General Agreement scores 
where the variance of direction scores 
is relatively small (presumably due to 
the balancing out of content), but the 
correlation between X and Y becomes 
very substantial. 

A simpler if less rigorous approach 
is to consider the more practical ques- 
tion: how good a substitute for what- 
ever is measured by the composite 
scores can be given by direction or by 
extremeness scores? Table 3 gives the 
relevant correlations together with the 
reliabilities of the composite scores. 

The correlations given in Table 3 
are representative of the values found 
for a wider variety of scales. The 
correlations of composite scores with 
direction scores tend to be around .90 
and above. The correlations with the 
extremeness scores are more variable 
and substantially lower. 

The results show fairly clearly that 
composite scores on these Likert at- 
titude items reflect primarily the direc- 
tion of responses, and only to a minor 


tive directions is always positive. ponse | 
Those who use large positive numbers extent their. extremeness. The prac- 
(strongly agree) tend to use large tical conclusion suggested is that there 
TABLE 3 
DATA FOR ComposITE SCORES 
Scale 
" (balanced) General 
Reliabilities and correlations F Dogmatism | Conservatism | Agreement 
us | UK | US ux | es | UK | US | UE 
.66 .86 a .66 
Reliability of C scores 9| 8 .62 | .68 
Correlations of C scores: 
with Direction scores ^ 95 97 93 06 87 07 06 95 
Te, P) SEa x 
with Extremeness scores— 15 56 46 i 52 46 03 33 
Fc, x) L4 |—.38 |-.38 as |—.19 |—.47 |-.18 |—-06 
Y(c, Y) | 
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is justification for scoring such items 
dichotomously according to the direc- 
tion of response. The resulting score 
will closely reflect whatever the com- 
posite score would. 

Further applications are necessary to 
determine whether these conclusions 
would extend to scales where a neu- 
tral category is permitted, to scales 
with more than six or seven steps, and 
to scales that do not concern attitudes. 
As regards more extended scales (e.g., 
of 9 or 11 steps) the reasoning used 
with regard to six-point scales would 
imply that the extremeness component 
might play a more significant role. 
However, in view of the prevalence in 
psychology of scales with no more 
than six or seven Steps, the present 
results may be widely applicable. 


Discussion 


Extremeness scores are quite reli- 
able if based on a large enough number 
of responses, and are to a considerable 
extent independent of direction scores. 
But although extremeness scores can 
be stable measures of something, they 
are largely unrepresented in the usual 
composite scores. Hence the hope that 
Likert scale scores for example should 
be “influenced by the degree to which 
subjects favor or oppose attitude state- 
ments” (Newcomb, 1950, p. 171) is 
largely unfulfilled in practice, 

Extremeness scores will tend not to 
have the same correlates as the com- 
posite scores, but could have correlates 
of their own. Only a few studies using 
Separate measures of extremeness give 
some evidence of what these correlates 
may be (e.g. Osgood, Suci, & Tan- 
nenbaum, 1957, pp. 226-236; Soueif, 
1958). Tn particular, the general con- 

clusion that extremeness is little re- 
flected in whatever the composite 
scores measure applies to the F Scale 
of “authoritarianism.” This fact is 
worth noting in view of the recurrent 


attempts to interpret extremeness as 
"intolerance of ambiguity," implying 
that extremeness ought to relate to 
composite scores on the F Scale— 
most recently Mogar (1960) uses ex- 
tremeness on the semantic differential 
as a criterion for composite F Scale 
Scores. 

What is the nature of extremeness 
scores themselves? Two possibilities 
have already been suggested : extreme- 
ness scores may represent actual dif- 
ferences in intensity (eg, of atti- 
tudes), or they may only represent 
differences in using the response scale 
(i.e., response set). Differences with- 
in an individual in the extremeness 
Of responses to different items pre- 
sumably represent primarily differ- 
ences in intensity, although even here 
the individual may change his usage 
of the response scale over time. 

On the other hand, as regards dif- 
ferences between individuals in aver- 
age extremeness, a different interpre- 
tation is suggested by the correlations 
between extremeness scores found in 
the study reported here. Extremeness 
seems to be a very general individual 
characteristic : correlations between ex- 
tremeness scores in both directions for 
all possible sets of items (including 
originals and reversals of the F, Dog- 
matism, Anti-Semitism, and Conserva- 
ism items) tend all to be positive, 
most of them substantially so. Thus, 
a person who shows relative extreme- 
ness in taking an “authoritarian” posi- 
tion when agreeing with F Scale orig- 
inals or disagreeing with reversals, 
tends also to show relative extreme- 
ness in taking an “anti-authoritarian” 
Position when disagreeing with F 
Scale originals or agreeing with re- 
versals, etc, It does not seem very 
plausible to assume that individual 
differences in the intensity of attitudes 
show such Systematic generality for 
responses in opposite directions, But 
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the size of the numbers subjects ac- 
tually write does show such generality. 
Hence differences in extremeness be- 
tween persons seem to represent pri- 
marily response sets characteristic of 
the individual—consistent tendencies 
to use extreme or moderate response 
categories. While some evidence sug- 
gests a secondary tendency for ex- 
tremeness scores to represent differ- 
ences in intensity,’ the primary factor 
—the very great generality of extreme- 
ness scores—seems better interpreted 
as response set. 


SUMMARY 


Scores on the bipolar scales in fre- 
quent use in psychology can be ana- 
lyzed into two components: the pro- 
portion of responses in each direction 
of the basic dichotomy, and the ex- 
tremeness of responses in cach direc- 
tion. An approximation is developed 
expressing variation in the usual com- 
posite scores in terms of variation in 
the two components of direction and 
extremeness, 

The approximation is applied to four 
sets of items from Likert attitude 
scales. The results indicate that com- 
posite scores reflect primarily the di- 
rection of responses, and only to a 
minor extent their extremeness. The 
Dractical implication is that there 15 
ns tend to be posi- 
and negative be- 


ble 1). This indi- 


_7 For example, correlatio! 
tive between P and X, 


tween P and Y (as in Table 1 pa 
Cates some tendency to use higher or ower 


Positions on the response scale as a whole, 
As would be expected if intensity were à 
actor. However, this tendency should lead 
to negative correlations between X and Y. 
and is in every case overcome by the 
Opposing tendency for extremeness scores 
to correlate positively. 


justification for scoring the items 
dichotomously according to the direc- 
tion of response. 

Since extremeness scores are reli- 
able and are largely unreflected in 
the usual composite scores, they may 
have quite different correlates of their 
own. Individual differences in aver- 
age extremeness show wide generality 
across different extremeness scores. 
This generality extends to responses in 
opposing directions—suggesting that 
the differences primarily represent re- 
sponse sets, and only to a secondary 
degree actual differences in intensity. 
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THE LATERAL HYPOTHALAMIC SYNDROME: 


RECOVERY OF FEEDING AND DRINKING AFTER 
LATERAL HYPOTHALAMIC LESIONS: 


PHILIP TEITELBAUM Ax» ALAN N. EPSTEIN? 


University of Pennsylvania 


The brain controls feeding behavior. 
If the base of the brain is damaged 
bilaterally in the ventromedial region 
of the hypothalamus, rats and other 
mammals overeat and become tre- 
mendously obese (Anand, Dua, & 
Schoenberg, 1955; Brobeck, Tepper- 
man, & Long, 1943; Wheatley, 1944), 
Damage in the lateral hypothalamus 
produces the opposite effect: the ani- 
mals do not eat and will die in the 
presence of food and water (Anand 
& Brobeck, 1951; Anand, Dua, & 
Schoenberg, 1955). Electrical stim- 
ulation through chronically implanted 
electrodes in the medial hypothalamus 
suppresses feeding, and stimulation in 
the lateral hypothalamus elicits it 
(Brügger, 1943; Larsson, 1954; Wy- 
rwicka & Dobrzecka, 1960). 

These dramatic and consistent phe- 
nomena suggest that a dual hypo- 
thalamic mechanism controls food in- 
take. Cessation of eating (satiety) is 
said to be controlled by a medial 
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The authors are grateful to Eliot Stellar 
for his generous advice and material sup- 
port throughout this work. Both he „and 
R. Duncan Luce made valuable criticisms 
of the manuscript. Malcolm M. Cohen and 
Jay J. Singer gave valuable technical as- 
sistance. 

2 Formerly at the Institute of Neurolog- 
ical Sciences, Medical School, University of 
Pennsylvania. Now at the Zoological Lab- 
oratories, Division of Biology at the uni- 


versity. 


74 


neural system whereas active feeding 
is maintained by a system in the lateral 
hypothalamus (Anand & Brobeck, 
1951). 

There have been two major de- 
velopments since the original descrip- 
tion of the lateral starvation phenom- 
enon by Anand and Brobeck in 1951 
which suggest a reinterpretation of the 
functions of the lateral hypothalamic 
system. First, not only do the ani- 
mals refuse to eat, but they also do not 
drink (Teitelbaum & Stellar, 1954). 
The adipsia was -emphasized by 
Montemurro and Stevenson (1957). 
They reported three rats with lateral 
hypothalamic lesions that did not drink 
water but did eat appreciable quanti- 
ties of a high-fat diet, maintaining 
their weight for as long as 55 days 
when tube-fed with 10% glucose solu- 
tion or water. Without hydration, 
the animals ate less and lost weight. 
This was the first demonstration that 
lateral hypothalamic lesions in the rat 
could produce adipsia without aphagia. 
The more recent finding that hyper- 
tonic saline injected directly into the 
lateral hypothalamus can elicit drink- 
ing as well as eating (Epstein, 1960a) 
is consistent with this. 

Second, the refusal to eat and drink 
produced by these lesions is not always 
permanent, and when recovery occurs, 
it follows a regular and orderly course. 
Teitelbaum and Stellar (1954), work- 
ing with large lateral hypothalamic 
lesions that produced both aphagia 
and adipsia, kept their rats alive by 
tube-feeding. Without maintenance 
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these animals would have starved to 
death, but when kept alive they even- 
tually began to eat and drink again. 
The animals first accepted wet and 
palatable foods and began to maintain 
their weight, then drank water, and 
finally, regulated their weight on 
ordinary dry food and water, and 
Seemed normal, 

We will show that lateral hypo- 
thalamic lesions produce a complex set 
of symptoms—a syndrome of deficits 
in drinking as well as in feeding be- 
havior from which the animals recover 
at different rates. In 35 rats (34 
females, 1 male), we have produced 
bilateral lateral hypothalamic lesions 
of varying sizes using a standard ster- 
Sotaxic technique (Stellar & Krause, 
1954), The coordinates used were: 
A 60, RL 2.0, and 8.0 down from the 
ural surface. By offering the ani- 
mals a variety of wet and palatable 
foods s in addition to the ordinary dry 

"rina laboratory chow pellets and 
Water, and by keeping them alive by 
tube-feeding if necessary, we have 
identified four clear-cut stages in the 
recovery from lateral hypothalamic 
sions. The stages always follow the 
Same sequence and are defined by 
Critica] behavioral events that occur 11 
Succession during the recovery proc- 
ess. Each of the stages has been 
analyzed experimentally in an attempt 
o identify the physiological deficits 
underlying it. 


7s ^ liquid diet was prepared by TEE 
of milliliters of whole egg; 125. milliliter 
e 50% sucrose solution; 240 milliliters 5 
YaPorated milk; 0.3 milliliters Poly-Vi- d 
oriltiple vitamin preparation of Mea 
nson Co, Evansville, Indiana) ; and 


milliliter iol à 
s Kaopectate (Upjohn es 
fm, Michigan). One part 10% forge 
pres? milliliters of diet was 
Servative, Milk chocolate T€ 


divided 
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osh was prepared by mixing Purma Lab 
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The animal whose record is graphed 
in Figure 1 had large symmetrical 
lesions which resulted in a long, slow 
recovery, thus making it possible to 
see the events in each stage quite 
clearly. 


STAGES IN RECOVERY 
Stage I: Aphagia and Adipsia 


The animals refuse all food, wet or 
dry, and they do not drink any water. 
Without tube-feeding, they will die. 
The animal of Figure 1 neither ate 
nor drank for 19 days, steadily lost 
weight, and had to be kept alive from 
the 7th day on by gastric intubation 
with a liquid diet. The behavior of 
such a rat is quite characteristic. It 
is generally inactive, but it is alert and 
can be easily roused to movement. It 
walks with a slightly hunched posture, 
carrying its head low to the ground. 
It typically ignores food and will 
actively push away food that is forced 

If such a rat gets wet food 
paws or face, it engages in 
s bilateral shaking of the front 
hich somewhat resembles the 
d face-grooming movements 
of the normal rat except that there is 
no grooming with the mouth. When 
its paws are wet, it wipes them re- 
peatedly on the floor of the cage, and 
if the face and mouth have food or 
water on them, it walks along and rubs 
its chin and the sides of its face on the 
cage floor. It does not groom itself. 
effectively, allowing its fur to become 

ompletely matted and caked with wet 

- d. It actively resists having milk 
ie id in its mouth by a medicine 
pure er, and it does not swallow the 
mE it is there, but rather allows 
m ut the side of the mouth. 

normal rat does not 

But it does en- 
paw-waving and 


age i 1 : 
oping, chin-rubbing, poor grooming, 
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their coats grow sleek as their weight 
rises, often to preoperative levels. 
Palatability is no longer essential for 
food intake. If these rats are trained 
to press a bar that allows a liquid diet 
to flow directly into the stomach 
through a chronically implanted gas- 
tric tube (Epstein, 1960b), they ingest 
normal amounts of food and maintain 
their weight throughout a 6-day test. 
'They regulate their caloric intake nor- 
mally even when the diet is diluted 
with water. When required to press 
the bar three times for each injection 
of food, they work sufficiently harder 
to maintain the intake. This regula- 
tion of food intake without stimulation 
of the oropharyngeal receptors has 
been demonstrated in three rats dur- 
ing this stage and is similar in all 
essential respects to that seen in nor- 
mal animals feeding themselves intra- 
gastrically (Epstein & Teitelbaum, in 
press). The animals press the bar be- 
cause they are hungry, and the taste 
of the food is no longer necessary to 
elicit feeding. 

The animals’ feeding behavior seems 
normal, but they do not drink water. 
However, they can be weaned from 
the liquid diet to a sweet nonnutritive 
solution. By mixing the liquid diet 
each day with an increasing propor- 
tion’ of the sweet fluid, they can be 
weaned first to a 10% weight by vol- 
ume (w/v) sucrose solution, and from 
this to a 0.5 or 0.2% w/v sodium 
saccharin solution. Thus, for the first 
time since the lateral hypothalamus was 
damaged, they are freely drinking 
nonnutritive fluid while still refusing 
dry food and water. As their weight 
begins to drop, they show the hy- 
peractivity of starvation, then they 
inevitably begin to eat the dry Purina 
diet and regain their weight. If the 
saccharin solution is then removed (as 
indicated in the hatched area labeled 
1 starting on the 49th postoperative 


day in Figure 1), they still refuse to 
drink water, their food intake drops to 
zcro in 2 or 3 days, they lose weight, 
and if prolonged sufficiently, will die. 
In 28 rats brought to this stage of 
recovery, all were weaned to sweet 
fluid and all ate dry food to maintain 
their weight in the process. For every 
animal, removal of the sweet fluid led 
to dehydration because it did not ac- 
cept water and, consequently, stopped 
eating. Restore the sweet fluid, how- 
ever, and it is accepted immediately 
(as seen in Figure 1). They eat dry 
food shortly thereafter and then re- 
gain their weight. Ín this stage, there- 
fore, the animals do not eat because 
they do not drink. 

The counter-experiment was per- 
formed on four rats. The saccharin 
solution was again removed, but 
water, its essential component, was 
supplied by hydrating the animals in- 
tragastrically with 2 to 3 milliliters 
every hour automatically through a 
chronically implanted stomach tube. 
As seen in Figure 1 in the hatched 
area labeled 2 starting on the 64th 
postoperative day, the animals con- 
tinue to eat dry food normally when 
hydrated, even though they no longer 
have saccharin to drink. Hydration 
either by drinking sweet solutions or 
by intragastric injections allows nor- 
mal feeding. It is clear, then, that in 
this stage of the lateral hypothalamic 
syndrome, the refusal to drink water 
produces dehydration which in turn 
prevents feeding, 

Does dehydration also account for 
the failure to eat in Stage I? Will 
these animals eat immediately after 
operation if they are hydrated ? 
Chronic gastric tubes were inserted in 
five animals before they were sub- 
jected to bilateral lateral hypothalamic 
lesions. From the time they recovered 
from the anesthetic, usually the day 
after operation, they were hydrated 
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Fic, 2. Food and water intake an 
lesions hydrated automatically from 
after a period of complete aphagia an 


with 2 to 3 milliliters of water auto- 
matically every hour. Figure 2 shows 
the results. All animals showed com- 
plete, immediate postoperative apha- 
gia, varying in duration from 2 to 10 
days. "Then they all displayed an- 
orexia rather than aphagia, eating 
some wet food but not enough to 
maintain their weight. Four of the 
five animals died. The fifth began 
to eat dry food on the 7th day and 
was able to maintain its weight. When 
hydration was stopped, it stopped 
eating and lost weight; it ate agam 
immediately after hydration was re- 
sumed. It had, therefore, recovered 
from its initial aphagia and anorexia 
and passed into Stage III, where it 
ate if, and only if, it was hydrated. 


Because all of the animals were 1n- 


itially aphagic and because four of the 
five animals died despite adequate hy- 
dration, we conclude that intragastric 
hydration initiated in the immediate 
postoperative period does not assure 
normal feeding (Morrison & Mayer, 


d body weight of five animals with lateral hypothalamic 


the time of operation. 
d anorexia despite hydration.) 


(Four of the animals died 


1957). The aphagia and anorexia of 
Stages I and II are not due just to 
dehydration. 

Why do the rats in Stage III after 
lateral hypothalamic lesions refuse tap 
or distilled water and yet readily ac- 
cept saccharin solutions? Do they 
drink the sweet saccharin solution be- 
cause they are thirsty or do they take 
it in the same way that they take a 
liquid diet—as a food? A normal ani- 
mal increases its intake of saccharin 
solution both when deprived of food 
(Carper & Polliard, 1953) and when 
dehydrated. Four normal rats, de- 
prived of food for 3 days, took little 
or no water but drank increasing 
amounts of saccharin (from two to 
four times as much as they did when 
feeding normally). Three normal rats, 
dehydrated by an intraperitoneal in- 
‘ection of 2 milliliters of 1 M NaCl 
per 100 grams body weight (Adolph, 
Barker, & Hoy, 1954), showed an 
average intake of 14 milliliters of 0.2% 
saccharin solution in 2 hours, whereas 
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three other normal rats drank 16 milli- 
liters of water when similarly dehy- 
drated. Thus a normal rat treats a 
saccharin solution as either a food or 
a fluid, depending upon its condition 
of deprivation. When hungry it cats 
a saccharin solution, and when thirsty 
it drinks the saccharin. 

What does a lateral hypothalamic 
animal do? When deprived of food 
for 3 days, four adipsic lateral hypo- 
thalamic rats increased their daily in- 
take of saccharin solution (from 20 to 
30056 more than when feeding freely). 
However, on repeated occasions, seven 
adipsic rats drank little or no saccharin 
solution in response to intraperitoneal 
hypertonic saline. There were 20 in- 
stances of complete lack of response 
for as long as 5 hours and five in- 
stances when they drank only 2 or 3 
milliliters. Even a large dose of salt 
that proved lethal failed to elicit in- 
creased fluid intake. Because lateral 
hypothalamic animals do not increase 
their saccharin intake when dehy- 
drated but do when deprived of food, 
they are, in effect, eating the sac- 
charin, not drinking it. They are 
hungry, not thirsty. Because they do 
not respond in this stage to levels of 
dehydration that produce vigorous 
drinking in normal animals, their re- 
fusal to drink water in all three initial 
stages of recovery may be due to a 
severe impairment of the central mech- 
anism for water intake regulation, 

The refusal to drink water may be 
permanent, as illustrated in Figure 1. 
The animal died 200 days after opera- 
tion, never again having drunk water, 
It sustained itself for most of that 
time by "eating" sweet fluid and eat- 
ing dry food. Three other animals 
showed permanent adipsia until they 
died after 267, 230, and 107 days, 
respectively. One animal, still living, 

has been adipsic for 485 days. 


Stage IV: Recovery 


Eventually, aíter passing through 
the stages described above, most rats 
with lateral hypothalamic lesions begin 
to drink water. Because they continue 
to eat food and maintain their weight 
now in the presence of water as the 
only fluid, they superficially appear 
normal and in a sense have recovered. 

llowever, more sensitive tests re- 
veal that severe deficits in water regu- 
lation still exist. Some animals, for 
example, still do not respond at all to 
dehydration by intraperitoneal hyper- 
tonic saline. This deficit has heen ob- 
served for months and may very well 
be permanent. Also, four of these 
animals when offered water for only 
2 hours each morning did not compen- 
sate by taking their daily requirement 
(approximately 25 milliliters) in that 
time as normal animals do. They took 
little or nothing, became progressively 
dehydrated, Stopped eating, and lost 
weight until they were nearly dead. If 
they drank at all, they did so shortly 
after the water was presented. They 
typically drank 1 or 2 milliliters 
(never more than 5 milliliters), moved 
to the back of the cage and slept. 
When returned to 24-hour access to 
water, they again managed in an ap- 
parently normal manner 

There is a curious paradox here. 
These animals do drink when the 
water is available overnight but do not 
drink in response to rapid dehydra- 
tion with hypertonic salt nor do they 
drink enough when water is available 
for 2 hours in the morning. The 
reason for this apparent contradiction 
was discovered by observing what the 
animals were doing when they drank. 
By attaching a drinkometer (Stellar & 
Hill, 1952) to the water tube and re- 
cording the pattern of water ingestion. 
we found that these animals drank 
nothing during the day, but 


drank in 
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short bursts of rapid licking almost 
continuously throughout the entire 
night. This is when they normally eat. 
In addition, if offered both food and 
water simultaneously at night, they 
began to drink only after they had 
begun to eat. Shortly after eating a 
little dry food, they drank a little 
water, ate more dry food, drank more 
water, and so on throughout the night. 
One of the animals showed this be- 
havior in its most extreme form. It 
would stand near the drinking tube 
holding a pellet in its front paws, 
munch on the pellet for a few minutes, 
lick actively at the water, and then go 
back to chewing on the pellet. Its 
eating was extremely inefficient. It 
would bffe large chunks off each 
pellet, chew them a little and then drop 
them while it drank a small quantity 
of water. In the course of the night 
it would waste as much if not more 
food than it actually ate by dropping 
small chunks of it ‘through the mesh 
floor of its cage. Its drinking was 
also abnormal. Only small amounts 
of water were drunk during bursts of 
active and precise licking. Although 
25 to 30 milliliters of water were con- 
sumed, the cumulative record showed 
continuous licking throughout the 
night, interrupted by pauses for short 
bouts of eating. The animal licked all 
night to consume an amount of watet 
that a normal rat could drink in sev- 
eral sustained draughts. It was as if 
the animal were drinking not to 
quench its thirst but simply to wet 


its mouth. 
These 
such an animal drin 
eats. To demonstrate : 
prived of water for 24 hours and of- 
fered access to it for 2 hours during 
the night (8 to 10 P.M.) when it t 
mally ate. It drank 22 milliliters : 
water while eating 12 grams of ue 
The amount of water deprivation made 


observations suggest that 
ks only while it 
this, it was de- 


no difference. It drank only in pro- 
portion to the amount it ate. After 
48 hours of water deprivation it drank 
22 milliliters of water while eating 11 
grams of food. When the experiment 
was repeated in the absence of food so 
the animal could not eat, it again did 
not drink any water in 2 hours despite 
as much as 48 hours of water depriva- 
tion. As soon as the food was re- 
placed, the animal started eating and 
only after eating for a while did it 
start to drink. The same result was 
obtained in all four animals that had 
previously failed to drink adequately 
in the morning. The animals were 
then trained to shift their eating into 
the morning hours (10 a.m. to 12 
noon) by offering them food only at 
that time. Now that they were eating 
in the morning, they drank enough 
water in 2 hours to maintain their 
weight on this regime for several days. 
Because the animals drink only while 
they eat, we call this phenomenon 
“prandial drinking.” 
In the same way that the anorexic 
animals seen in Stage II are eating 
without caloric regulation, so in Stage 
IV, when the animals first begin to 
drink water, they are drinking with- 
out response to bodily dehydration. 
Just as palatability was sufficient to 
arouse appetite for food, so the mouth 
factors involved in eating dry food 
may be sufficient to arouse prandial 
rinking. 
i These animals are greatly affected 
by slight changes in the taste of their 
water. Normal rats will drink bitter 
quinine hydrochloride solutions, up to 
a concentration of 196 w/v, if they are 
the only fluids available. When in- 
gested at this concentration, 15 to 20 
milliliters of quinine hydrochloride is 
lethal to a rat. A normal rat, there- 
fore, will drink quinine-treated water 
until it becomes a poison. As this 
point is approached, they drink less 
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and less, gradually stop eating, and 
die. Lateral hypothalamic rats that 
have been drinking water reject qui- 
nine hydrochloride solutions at a con- 
centration as low as 0.005%. They 
refuse it day after day, stop eating, 
lose weight, and die. But when they 
are hydrated intragastrically with it, 
they thrive. Thus, the refusal to eat 
and drink until death seen in the 
earlier stages of the lateral hypotha- 
lamic syndrome can be reinstated in 
apparently recovered animals merely 
by making their water slightly bitter. 

A normal rat, as it becomes thirstier, 

overcomes the barrier to ingestion im- 
posed by the bitter taste of the quinine 
up to the lethal concentration and 
drinks enough to stay alive, But a rat 
with lateral hypothalamic damage does 
not. Even though ít drinks normal 
amounts and maintains itself when 
water is constantly available, the 
lateral hypothalamic animal fails to 
overcome the quinine taste barrier. In 
addition, it fails to drink adequate 
quantities when access to water is 
limited despite prolonged dehydration, 
and fails to respond to intraperitoneal 
saline challenge. These deficits in mo- 
tivation and in water regulation imply 
that the central neural thirst mechan- 
ism is still severely impaired. The 
powerful urge to drink is still absent. 
These animals drink only while they 
eat, not in response to dehydration. 
There is a striking resemblance here 
to Adolph’s (1957) description of the 
ontogeny of food and water regulation. 
The infant rat develops hunger before 
thirst and, in the neonatal period, does 
not respond to dehydration. It is 
adipsic but not aphagic. 

Although, accordihg to many meas- 
ures, food intake has been normal since 
early in Stage III, more sensitive 
tests again reveal continuing deficits. 
Here, too, taste seems to be an im- 
portant factor. Even though they are 


apparently eating normally, as little 
as 0.05% quinine hydrochloride w/v, 
mixed thoroughly in dry Purina 
powder, is often sufficient to cause 
rats with lateral hypothalamic lesions 
to refuse food completely to the point 
of extreme starvation. Normal rats 
readily accept food with up to 1.25% 
quinine, at which point they begin to 
decrease their intake and lose weight. 
When faced with the bitter taste of 
quinine, rats with lateral hypothalamic 
lesions, therefore, display a greater 
aversion to a negative taste in food. 
Palatability is again a powerful deter- 
minant of food intake in these animals. 
In the anorexic stage (Stage II), only 
highly palatable foods induced them 
to eat. And in the recovery stage, a 
slight negative taste prevents them 
from eating. This resembles the fin- 
icky eating observed in obese hyper- 
phagic rats after more medial hypo- 
thalamic lesions (Teitelbaum, 1955). 
This finickiness implies either an im- 
paired motivation for food (Miller, 
Bailey, & Stevenson, 1950; Teitel- 
baum, 1957) or an increased reactivity 
to its taste; presumably it is present 
in its most extreme form in the early 
stages of the lateral hypothalamic 
syndrome. 

The recovery process continues even 
after the animal has started drinking 
water again. Some rats become more 
responsive to intraperitoneal injection 
of hypertonic saline and indeed three 
of seven rats tested eventually re- 
sponded as well as a normal animal. 
Similarly in these same three animals 
it required as much as 0.196 quinine 
to prevent them from drinking whereas 
earlier they had been stopped by 4 
0.005 or 0.0196 quinine solution. 
gradual recovery can also be seen in 
the gross regulation of water intake, 
food intake, and weight. Immediately 
after entering the recovery stage, an 
for some time thereafter, some rats 
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Fic. 3. Stages of recovery seen in the 
behavioral events which 


with lateral hypothalamic lesions drink 
water in less than normal amounts, 
eat correspondingly less, and so main- 
tain their weights at a subnormal level 
(210 to 220 grams). This hypodipsia 
may last for months, but usually it 
disappears. Such a rat begins to drink 
and eat more, and it regulates its 
weight at 270 to 280 grams, a more 
normal weight for its age. 

The four stages of recovery from 
lateral hypothalamic lesions are sum- 
marized in Figure 3, which lists the 
Critical behavioral events whose pres- 
ence or absence defines those stages 
First, there is a stage of "aphagia and 
adipsia” in which the animal com- 


pletely refuses to eat or drink; second, 
i n B . . . 
‘anorexia and adipsia” in which it 

intain its 


eats, but not enough to mar 

weight, and does not drink; third, 
"adipsia and dehydration-aphagia," in 
in which the animal fails to eat only 


because it does not drink; and finally 
in which the 


the "recovery" stage 

animal drinks water, eats dry food, 
and will survive, but still shows cer- 
lain deficits in eating and drinking 


lateral hypothalamic syndrome. 
define the stages are listed on the left.) 


SUR 


YES 


NES 


NES 


(The critical 


behavior. It must be emphasized that 
these stages in recovery will not be 
observed unless the animal is offered 
wet and palatable foods from the time 
of the operation. If only water and 
dry food are available, the rat will die 
unless it rapidly reaches Stage IV, at 
which point it accepts water and can, 
therefore, eat dry food. 


ANATOMICAL ANALYSIS 


The duration of these stages is vari- 
able and seems to depend on both the 
locus of the lesions and the amount of 
tissue damaged. The lesions in 13 
animals were studied microscopically 
using thionin-stained celloidin sections 
through the diencephalon. Tissue in 
the lateral hypothalamic areas was 
destroyed bilaterally in all animals. 
Typically, the lesions were large, ex- 
tending medially to the mammillo- 
thalamic tracts and the fornices (the 
itself was only rarely de- 

and laterally into the medial 
f the internal capsules and 
Damage was greatest at 


fornix 

stroyed) 
portions o 
optic tracts. 
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the level of the ventromedial nuclei. 
The lesions extended anteriorly to the 
level of the supra-optic nuclei and 
posteriorly into the pre-mammillary 
regions. Dorsally, the overlying sub- 
thalamus was always involved. In 


the majority of animals, the most 
ventromedial portions of the lateral 
hypothalamic areas were spared. 
Figure 4 shows a frontal section 
showing lesions in the lateral hypo- 
thalamus at the level of the ventro- 


NORMAL 


E: 


dhe bopel showing edom io Bye aaan elk pt fe beset region c 
: à ! with interesting variations of the lateral 
hypothalamic syndrome. (Damaged tissue is outlined in black As Lar etrical 
z " CE . : Large, symmetrica 
lesions producing permanent adipsia. Recovery graphed in Figure 1. B: Transient aphagia 
and anorexia, despite immediate postoperative hydration, followed by recovery of food and 
water intake. Animal 3 in Figure 2. C: Died on sixth postoperative day despite adequate 
hydration. Showed aphagia and anorexia. Animal 5 in Figure 2. D: Asymmetrical lesions 
producing transient adipsia without aphagia. E: Massive lesions involving both medial 
and lateral tissue producing hypothalamic hyperphagia and permanent adipsia. 

Normal animal: Arc, arcuate nucleus; DM, dorsomedial nucleus; Fx, fornix; Hab 
habenula; IC, internal capsule; Inf. infundibulum; LHA, lateral hypothalamic area; MT, 
mammillothalamic tract; OT, optic tract; Rh, rhinencephalon; VM, ventromedial nucleus: 
VT, ventral thalamus; ZI, zona incerta; III V, third ventricle.) 
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medial nuclei in five animals who 
exhibited interesting variations of the 
lateral hypothalamic syndrome. The 
extent of damage has been outlined 
heavily in black. With the very large 
lateral hypothalamic lesions of Section 
A, each stage in the recovery process 
lasted for weeks or months. This is 
the animal whose recovery is shown 
graphically in Figure 1. This rat 
showed prolonged aphagia and ano- 
rexia (38 days), and it never drank 
water again. Such an animal, because 
of the dehydration-aphagia that is 
present when it refuses to drink water, 
would seem to be permanently aphagic 
on a dry diet. It was only by induc- 
ing it to accept a palatable, wet diet 
that the stage of pure adipsia could 
be demonstrated. With smaller or 
asymmetrical lesions, as in Sections 
B, C, and D, the entire sequence passes 
very quickly, and the animal may 
begin the recovery not at Stage I, but 
at any of the later stages. These ani- 
mals show aphagia and anorexia for 
only a few days or a week if offered 
a palatable liquid diet, and recover 
water intake within 1 or 2 months. 
Section B is from the brain of animal 
3 in Figure 2. It refused to eat for 
3 days despite automatic hydration. 
then nibbled wet food for 4 days, but 
remained adipsic for 1 month after 
Operation. Section C shows the lesions 
of Animal 5 in Figure 2. lt was 
aphagic and anorexic until it died even 
though hydrated. No attempt was 
made to keep this animal alive by tube- 
feeding, but since it had already begun 
to eat appreciable amounts of wet food. 


it probably would have regulated i 
caloric intake within a few more Cay” 
pothalamic 


Section D shows lateral hy l 
lesions which produced only “oe 
Without any aphagia or anorexia. i 
animal accepted wet mash or liquid 
diet immediately and regulated its 
Weight, but it refused to drink water 


and showed dehydration-aphagia on a 
dry diet. This effect closely re- 
sembles the earlier findings of adipsia 
without aphagia in the rat by Monte- 
murro and Stevenson (1957) and in 
the dog by Andersson and McCann 
(1956). Finally, Section E, at a 
slightly more rostral level, shows a 
huge lesion that involves the entire 
hypothalamus except for the infundi- 
bulum, destroying both the ventro- 
medial nuclei and the lateral hypo- 
thalamic areas, and extending laterally 
well into the internal capsules. This 
animal was aphagic and anorexic for 
only 4 days, then regulated its intake 
of wet food. It refused to drink water 
until it died 107 days later. When 
offered a wet diet it overate and be- 
came obese; but every time it was 
offered a dry diet and water, it re- 
fused to drink, stopped eating as a 
result of dehydration, and lost weight 
precipitously. This verifies the earlier 
findings of Williams and Teitelbaum 
(1959), who showed that hypotha- 
lamic hyperphagia can coexist with 
the lateral hypothalamic syndrome. It 
cannot be manifested in Stages I or II 
during which caloric regulation is not 
seen, but in Stage III, when caloric 
regulation appears, so does hypotha- 
lamic hyperphagia. But hydration is 
necessary. With a dry diet and water 
the only fluid available, the animal 
does not drink, and hypothalamic hy- 
perphagia is masked by adipsia. In 
addition, such animals make it clear 
that damage to the ventromedial re- 
gions of the hypothalamus can produce 
hyperphagia even when the lateral hyv- 
pothalamic areas are largely destroyed. 

In general, the severity of the def- 
icits was correlated with the size of 
the lesions. Transient deficits and 
rapid recovery followed small and 
asymmetrical lesions, whereas large 
lesions permitted only slow and in- 
complete recovery through stages of 
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long duration. This suggested that 
the recovery of function was mediated 
by the undamaged tissue around the 
lesions. Additional lesions in the tis- 
sue immediately adjacent to the origi- 
nal damage were made in six recovered 
animals. In all of them, additional 
lesions were found to reinstate the 
syndrome. After appropriate lesions 
the reoperated animals became aphagic, 
anorexic, and adipsic all over again, 
and the recovery followed the same 
general pattern. The phenomenon was 
particularly clear in one animal. The 
recovery process was repeated four 
times. Each successive recovery, in 
this animal, was slower. The fourth 
recovery proceeded only to anorexia 
and was terminated by a fatal gastric 
hemorrhage. Other animals have been 
rendered permanently anorexic by ad- 
ditional lesions, in one for as long as 
270 days. Therefore, the undamaged 
tissue surrounding the original lesion 
participates in the recovery. In ad- 
dition, this result suggests that the 
experience gained in the initial re- 
covery from lateral hypothalamic le- 
sions did not protect the animal from 
the effects of additional damage. 

Since the lateral hypothalamic syn- 
drome in rats is the result of a com- 
plex interaction of deficits in both 
feeding and drinking, the lateral hy- 
pothalamus must contain tissues con- 
cerned with both functions. The struc- 
tures involved could be cells within 
the lateral hypothalamus or fibers of 
passage from other areas. Morgane 
(1961a) reports a syndrome of apha- 
gia and adipsia in the rat with lesions 
in the apex of the globus pallidus, 
completely outside the lateral hypo- 
thalamus. He points out that the 
major efferent pathways from the 
globus pallidus (ansa and fasciculus 
lenticularis) are usually interrupted by 
lateral hypothalamic lesions. Thus, 
extrahypothalamic structures partici- 


pate in the regulation of food and 
water intake. 

Whatever the nature of the neuro- 
anatomical systems involved in feeding 
and drinking, some parts of them must 
overlap in the lateral hypothalamus. 
Is it possible to separate them? A 
chemical separation of the systems has 
been demonstrated in the rat by Gross- 
man (1960). He showed that from 
the same site within the perifornical 
region of the hypothalamus, chol- 
inergic drugs elicit drinking and ad- 
renergic drugs produce feeding. This 
implies that the functions of each of 
the systems may be dependent upon 
different neurohumoral agents. 

An anatomical separation may be 
possible because small, asymmetrical 
lesions produce adipsia without apha- 
gia and pure effects have been pro- 
duced by electrical stimulation. Drink- 
ing without feeding has been observed 
in the goat and rat (Andersson & Mc- 
Cann, 1955-56; Greer, 1955), as well 
as feeding without drinking (Larsson, 
1954; Miller, 1957; Smith, 1956). 
Thus, it is quite possible that two 
systems which do not entirely overlap, 
one concerned with drinking and the 
other with feeding, merge in the lateral 
hypothalamus, and so destruction 
there yields the combination of def- 
icits in feeding and drinking behavior 
that makes up the lateral hypothalamic 
syndrome, 

Morgane (1961b) has suggested a 
further subdivision of the lateral hypo- 
thalamic feeding system. He finds 
that rats with lesions in what he has 
called the “far-lateral” region of the 
lateral hypothalamus (centered just 
medial to the internal capsule) lose 
weight more rapidly while suffering 
aphagia and adipsia and are less likely 
to recover than animals with lesions 
in the “mid-lateral” hypothalamus 
(centered just lateral to the fornix). 
According to Morgane, recovery 1$ 
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the rule following mid-lateral lesions. 
But recovery is the exception, if not 
an impossibility, after far-lateral le- 
sions. He finds it difficult to main- 
tain the far-lateral animal alive on a 
tube-feeding regime that is adequate 
for the mid-lateral animal. He sug- 
gests that the far-lateral region of the 
hypothalamus contains pallidofugal 
structures (ansa and fasciculus lentic- 
ularis) that are essential for the basic 
feeding reflexes. Lesions here should 
therefore produce permanent aphagia. 
In addition, the failure of the far- 
lateral animal to survive on tube-feed- 
ings leads Morgane to attibute its 
rapid decline to a form of “metabolic 
failure.” 

In our own experience, as described 
in this paper, many of the animals 
that have shown the typical stages of 
recovery were found to have extensive 
damage in the far-lateral hypothalamus 
(see Figure 4, especially Section A). 
Clearly these animals have very severe 
and prolonged deficits. In this we 
agree with Morgane. However, they 
do feed again, although they may Te- 
main adipsic. In the early stages a 
their recovery, they can be maintained 
on tube feeding if cared for assidu- 
ously, The more rapid decline in 
Weight seen in similar animals and at- 
tributed to metabolic failure by Mor- 
Sane can not, therefore, be due to a 
inability to utilize food but may e 
related to changes in activity, tem- 
Perature regulation, Or metabolic rate. 
Stevenson? has shown that lesions 
Within the lateral hypothalamus may 
Produce an increase in metabolic = 
and has suggested that this pae 
energy expenditure may a the 
the rapid postoperative decline © 
far-lateral animal. In addition, NA 
must re-emphasize that lateral hy s 

alamic lesions produce deficits 
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both feeding and drinking. Prolonged 
deficits then may simply represent the 
summated effects of extensive destruc- 
tion of both systems rather than selec- 
tive involvement of “more basic” and 
functionally distinct portions of the 
feeding system alone. 


SUMMARY AND CONCLUSIONS 


Although we have made interpreta- 
tions throughout this paper as new 
phenomena were described, there are 
several generalizations from this work 
that deserve emphasis. Each of them 
has changed our thinking about the 
lateral hypothalamic control of feed- 
ing and drinking. 

We have confirmed the idea that 
there are tissues in the lateral hypo- 
thalamus that are concerned with both 
feeding and drinking. Aphagia is al- 
ways accompanied by adipsia. We 
have never seen pure aphagia—i.e., 
absence of feeding where drinking is 
unimpaired. Any impairment in feed- 
ing then that follows these lesions can 
only be fully understood as the com- 
bined result of deficits in both regula- 
tions. 

Recovery is the rule following these 
lesions, if the animals are maintained 
long enough. Barring accidents, or 
gastrointestinal pathology, it is pos- 
sible to keep these animals alive in- 
definitely. Although we have seen 
prolonged deficits, we have never seen 
an animal with lateral hypothalamic 
lesions that did not show some re- 
covery. 
endind recovers before drinking. 
The quality of the diet is crucial for 
this observation. At first the animal 
will eat only wet and palatable foods. 
But if it is offered nothing except dry 
pellets and water, it will not eat until 
after it has begun to drink, and this 
relationship will be misunderstood. It 
appears to Us, therefore, that lateral 
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hypothalamic lesions impair feeding 
less severely than drinking. 

Eating does not necessarily imply 
regulation. When the animals first 
begin to eat they are anorexic. They 
eat only highly palatable foods, and 
do not eat enough to regulate caloric 
intake or body weight. Their feeding 
seems to be evoked by the taste and 
smell of the food. They are drawn 
to it by appetite, not driven to it by 
hunger. 

Hunger returns, often abruptly. A 
new control reappears, enabling the 
animal to regulate its caloric intake, 
The central feeding mechanism is now 
responsive to some variable associated 
with caloric need. If this variable 
were identified, we would know one 
control system that is sufficient for 
caloric regulation in the normal rat. 

Even after the return of regulatory 
feeding, adipsia persists. It can mask 
caloric regulation if only dry food and 
water are available. The animal will 
not drink water and aphagia will be 
the result of dehydration. If the ani- 
mal is induced to eat saccharin solu- 
tions, it hydrates itself involuntarily 
and then eats dry pellets. Thus, 
adipsia can occur without aphagia. 

Adipsia can be permanent. Some 
animals never drink water again, 
though they live out their natural 
lives. In our experience, aphagia is 
always transient. We have never seen 
permanent aphagia after lateral hypo- 
thalamic lesions. This confirms our 
belief that the central neural tissue 
regulating thirst is more completely 
impaired by these lesions than is the 
tissue controlling feeding. 

Drinking frequently reappears. Ini- 
tially the animals drink only while 
they eat. Just as anorexia is feeding 
without starvation so 
prandial drinking is drinking without 
response to dehydration. Here again, 


response to 


ingestion does not necessarily imply 
regulation. Like hunger, thirst may 
eventually reappear. The central 
thirst mechanism is now responsive 
to dehydration. 

Although regulation of food and 
water intake has recovered, motiva- 
tion is still impaired. The recovered 
animal is greatly affected by the taste 
of both its food and water. Addition 
of small amounts of quinine deters it 
from eating and drinking. Wherever 
taste is excessively important, motiva- 
tion is impaired. 

Finally, in the lateral hypothalamic 
syndrome, the initial transition from 
aphagia to anorexia and from adipsia 
to prandial drinking seems to depend 
upon recovery of the influence of pe- 
ripheral sensations (taste and smell 
of food, or dry mouth), The next 
transition involves the recovery of 
central neural controls permitting not 
only ingestion but ingestion with reg- 
ulation. Anorexia gives way to hun- 
ger and prandial drinking to true 
thirst. This supports the commonly 
accepted view that feeding and drink- 
ing are both under multiple control 
(Brobeck, 1960; Stellar, 1954). Be- 
cause all controls may operate simul- 
taneously in the normal animal, it has 
been difficult to identify those essential 
for regulation. In the lateral hypo- 
thalamic syndrome, the reappearance 
of these controls is separated in time 
during recovery. They are not all 
operating simultaneously, Controls that 
operate in the later stages of recovery 
are absent in earlier stages. It should 
be possible to identify both the con- 
trols that are present and those that 
are absent during any particular stage 
by subjecting the animal in that stage 
lo the variables known to influence 
feeding and drinking in the normal 
animal. If we find controls that are 
absent before regulation, and which 
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reappear with it, we may then be able 
to specify the controls necessary for 
regulation. 


REFERENCES 


ApoLpH, E. F. Ontogeny of physiological 
regulations in the rat. Quart. Rev. Biol., 
1957, 32, 89-137. 

Avotpn, E. F., Barker, J. P., & Hoy, P. A. 
Multiple factors in thirst. Amer. J. 
Physiol., 1954, 178, 538-562. 

Ananp, B. K. & Bnoneck, J. R. Hypo- 
thalamic control of food intake. Yale J. 
Biol. Med., 1951, 24, 123-140. 

Axaup, B. K., Dua, S, & SCHOENBERG, K. 
Hypothalamic control of food intake in 
cats and monkeys. J. Physiol., 1955, 127, 
143-152. 

Anpersson, B, & McCann, S. M. Drink- 
ing, antidiuresis and milk ejection from 
electrical stimulation within the hypo- 
thalamus of the goat. Acta physiol. 
Scand., 1955-56, 35, 191-201. 

Anpersson, B., & McCann, S. M. The 
effect of hypothalamic lesions on the water 
intake of the dog. Acta physiol. Scand., 
1956, 35, 312-320. 

Broseck, J. R. Regulation of feeding and 
drinking. In J. Field (Ed.), Handbook 
of physiology. Scct. 2, Vol. 2. Baltimore: 
Williams & Wilkins, 1960. 

Brosecx, J. R., TEPPERMAN, Js 
C. N. H. Experimental hy 
hyperphagia in the albino rat. 
Biol. Med., 1943, 15, 831-853. 

Bnüccem, M.  Fresstrieb als  hypotha- 
lamisches Symptom. Helv. physiol. phar- 
macol, Acta, 1943, 1, 183-198. 

Carper, J. W, & PoLLrAmD, F. A com- 


parison of the intake of glucose and m 
under conditions © 


& Lone, 
pothalamic 
Yale J 


charin solutions 
caloric need. Amer. J. Psychol., 1953, 
66, 479-482. ‘ai 


Reciprocal changes 
feeding behavior produced by intrahypo- 
thalamic chemical injections. Amer. 
Physiol, 1960, 199, 969-974. (a) 

Erster, A, N. Water intake without nip 
act of drinking. Science, 1960, 131, 497- 
498. (b) 

Epstern, A. N., & TerrerpauM, P. Re 
ulation of food intake in the absence » 
taste, smell, and other ML MW 
sensations, J. comp. physiol. Psychol., 


in press. P. iue 
" 

Fugxcu, J. D. Porter, R. Wi B 

Ameroncen, F. K, & Raney, R. B. 


Epstern, A. N. 


Gastrointestinal hemorrhage and ulcera- 
tion associated with intracranial lesions. 
Surgery, 1952, 32, 395-407. 

Greer, M. A. Suggestive evidence of a 
primary “drinking center” in hypothal- 
amus of the rat. Proc. Soc. Exp. Biol. 
Med., 1955, 89, 59-62. 

GRossMAN, S. P. Eating or drinking 
elicited by direct adrenergic or cholinergic 
stimulation of hypothalamus. Science, 
1960, 132, 301-302. 

Larsson, S. On the hypothalamic organ- 
ization of the nervous mechanism regulat- 
ing food intake. Acta physiol. Scand., 
1954, 32, Suppl. No. 115, 7-40. 

Mutter, N. E. Experiments on motivation. 
Science, 1957, 126, 1271-1278. 

Mitter, N. E., Bairev, C. J., & STEVENSON, 
J. A. F. Decreased "hunger" but in- 
creased food intake resulting from hypo- 
thalamic lesions. Science, 1950, 112, 256- 
259. 

Montemurno, D. G., & Stevenson, J. A. F. 
Adipsia produced by hypothalamic lesions 
in the rat. Canad. J. Biochem. Physiol., 
1957, 35, 31-37. 

MoncANE, P. J. Alterations in feeding and 
drinking behavior of rats with lesions in 
globi pallidi. Amer. J. Physiol, 1961, 
201, 420-428. (a) 

MoncaxE, P. J. Medial forebrain bundle 
and “feeding centers” of the hypothalamus. 
J. comp. Neurol, 1961, 117, 1-26, (b) 

Morrison, S. D., & Mayer, J. Adipsia and 
aphagia in rats after lateral subthalamic 
lesions. Amer. J. Physiol., 1957, 191, 
248-254. 

Sw, O. A. Stimulation of lateral and 
medial hypothalamus and food intake in 
the rat. Anat. Rec., 1956, 124, 363-364. 

SrELLAR, E. The physiology of motivation. 
Psychol. Rev., 1954, 61, 5-22. 

Sreviar, E, & Hut, J. H. The rat's rate 
of drinking as a function of water depriva- 
tion. J. comp. physiol. Psychol., 1952, 
45, 96-102. 

SrELLAR, E., & KRAUSE, N. P. New stereo- 
taxic instrument for use with the rat. 
Science, 1954, 120, 664-666. 

Srevenson, J. A. F, & Rrxon, R. H. 
Environmental temperature and depriva- 
tion of food and water on the spontaneous 
activity of rats. Yale J. Biol. Med. 
1957, 29, 575-584. 

TerrerBauM, P. Sensory control of hypo- 
thalamic hyperphagia. J. comp. physiol. 
Psychol., 1955, 48, 156-163. 

TEITELBAUM, P. Random and food-directed 
activity in hyperphagic and normal rats. 


90 PuiLiP TErTELBAUM AND ALAN N. EPSTEIN 


J. comp. physiol. Psychol., 1957, 50, 486- 
490. 

TEITELBAUM, P., & SrELLam, E. Recovery 
from the failure to eat produced by hypo- 
thalamic lesions. Science, 1954, 120, 894- 
895. 

WnuEATLEY, M. D. The hypothalamus and 
affective behavior in cats. Arch. Neurol. 
Psychiat., Chicago, 1944, 52, 296-316. 


WirLraMs, D. R., & TErTELBAUM, P. Some 
observations on the starvation resulting 
from lateral hypothalamic lesions. J. 
comp. physiol. Psychol., 1959, 52, 458-465. 

Wyrwicka, W., & Dosrzecxa, C. Relation- 
ship between feeding and satiation centers 
of the hypothalamus. Science, 1960, 132, 
805-806. 


(Early publication received August 14, 1961) 


Psychological Review 
1962, Vol. 69, No. 2, 91-108 


A STATISTICAL THEORY OF DYNAMIC 
CONTOUR PERCEPTION * 


W. M. SMITH 
Dartmouth College 


Bartley’s (1941) statement made al- 
most twenty years ago, “. . . that the 
matter of contour formation is one 
of the central phenomena of sensory 
psychology and nerve physiology" (p. 
334), certainly is as true today as it 
was when he wrote it. Indeed, the 
way in which borders are perceived 
is a fundamental problem in visual 
science. 

It is very clear that the classical 
idea of the retinal mosaic as a system 
of insulated receptors cannot meet 
adequately various facts of visual acu- 
ity, brightness discrimination, and 
contour definition, Visual acuity is 
greater than the classical notion im- 
plies, and contour perception is not 
dependent upon two parallel arrays of 
receptors, one array stimulated, the 
other not. Inadequate consideration 
of postretinal structures has been the 
major failing of what is now consid- 
ered to be the classical view of visual 
acuity, 

The work of Marshall and Talbot 
(1942) is a formulation of a theory of 
visual acuity and contour resolution 
in terms of the structure and the func- 
tion of retinal and postretinal neural 
elements. Drawing upon recent find- 
ings and concepts of neurophysiology, 
they view such basic processes of acu- 
ity and contour perception as agent 
ent upon the interrelated activity © 
populations of receptors and neural 
elements. Appropriately, their formu- 
lation now is referred to as 4 statis- 
tical theory of visual function. 

Although we shall not present a re- 
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view of the Marshall and Talbot 
work, for expository purposes later it 
will be helpful to mention some gen- 
eral features of their theory here. 
Their analysis rests heavily upon the 
neurological concepts of reciprocal 
overlap, lateral and vertical summa- 
tion in the visual system, physiological 
nystagmus, and the nature of the neu- 
ral recovery cycle. Especially impor- 
tant is the fact that their theory places 
the locus of the critical aspects of con- 
tour perception in the visual cortex 
rather than at the retinal level. 

In their original work Marshall and 
Talbot were concerned primarily with 
the general question of sensory acuity, 
but in more recent years other in- 
vestigations have extended the statis- 
tical theory to other aspects of vision. 
Osgood and Heyer (1952), for ex- 
ample, have applied it to the analysis 
and interpretation of figural afteref- 
fects and have claimed the statistical 
theory to be superior to the field con- 
ception offered by Köhler and Wal- 
lach (1944). Regarding figural after- 
effects, however, both the field and the 
statistical interpretations have had 
their critics (Deutch, 1956; Smith, 
1952). More recently, Day (1956, 
1957) has used statistical theory in 
the interpretation of phenomenological 
reports and experimental data con- 
cerning form perception. In particu- 
Jar, he has employed statistical theory 
in considering the question of the 
emergence of form in the peripheral 
retina and in interpreting data from 
studies of area and area-perimeter 
ratios in form perception. Finally, 
Osgood (1953, pp. 245-47) has pre- 
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sented a brief account of how statis- 
tical theory can be applied to the 
problem of apparent movement. 

It is with the perception of move- 
ment that the present paper is con- 
cerned. We shall attempt to extend 
statistical theory to a treatment of 
perceived movement. First, we shall 
describe briefly the basic experimental 
situation. Then we shall summarize 
the results from which the theoretical 
treatment was developed. Finally, we 
shall present an account of our modi- 
fication and extension of statistical 
theory with supporting evidence. 


DYNAMIC CONTOUR PERCEPTION 

For present purposes we define “dy- 
namic contour perception” as the per- 
ception of the borders of moving stim- 
uli. The definition could be modified 
to include instances wherein the ob- 
server moves rather than the stimulus. 
We shall not consider this modifica- 
tion here, 

In two previous papers (Smith & 
Gulick, 1956, 1957) data have shown 
that the perception of the borders of 
moving stimuli is influenced not only 
by the velocity of the stimulus, but also 
by the length of time the stimulus is 
observed in a fixed position prior to 
movement. The results of these ex- 
periments indicated that the border of 
a small square could not be seen 
clearly at velocities exceeding about 
13 degrees of visual angle per second. 
However, when the stimulus was pre- 
sented in a fixed position prior to 
movement, the maximum velocity at 
which the edges could be perceived 
was increased by a factor or two or 
more in most instances. A systematic 
study of the combinations of velocity 
and initial exposures needed to main- 
tain contour revealed a linear relation- 
ship up to exposures of about 350 
milliseconds and velocities of about 30 

degrees of visual angle per second. 


For every increase in velocity that pro- 
duced loss of contour, an increment in 
the duration of the initial exposure 
could be utilized to regain it. In more 
recent unpublished studies in which 
the duration of exposure of the initial 
stationary phase of the stimulus has 
been extended to 1 second, we find 
that the ability to see an edge with 
increasing velocities of movement is 
related curvilinearly to the duration 
of the initial stationary phase. At 
velocities exceeding about 40 degrees 
per second it appears that the fixed 
phase of presentation, regardless of its 
duration, will not enable one to see 
a moving edge clearly. In an earlier 
paper (Smith & Gulick, 1957) we of- 
fered a preliminary statement of sta- 
tistical theory as applied to the com- 
bined effects of the initial exposure of 
the stimulus in the fixed position and 
the movement phase, In the present 
paper we wish to expand this pre- 
liminary theoretical treatment. 


HISTORICAL BACKGROUND OF THE 
MonEL 


When an observer's total visual 
field consists of a light-dark pattern 
divided vertically, there is no apparent 
gradient of gray where light meets 
dark. The discontinuity is clearly de- 
fined. How the visual system is able 
to "construct" an edge from such a 
distribution of light energy striking 
Visual receptors is not well under- 
Stood. However, it is known that the 
abrupt gradient of reflected light 
Which exists at the border of a stimu- 
lus object is not present once the 
pattern of light reaches the retina 
(Fry, 1955). "The abrupt gradient be- 
comes less abrupt so that the intensity 
of light comprising the geometric 
image of a border at the retina ap- 
proximates the integral of a normal 
distribution. It is this pattern of light 
energy which courses back and forth 
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the distribution of light of this pattern 
on the retina approximates à normal 
curve. As the angular width of such 
à pattern increased, the symmetrical 
distribution at the retina would be 
changed progressively, becoming flat- 
tened on top. Each border, however, 
Would be represented on the retina by 
à distribution of light similar to that 
lor the discontinuity of the vertically 
divided field mentioned previously. 
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«citation. within the afferent system, c. At 
tion is peaked further and is the direct 


of a very complicated series of events 
and mechanisms intervening between 
cornea and cortex. The abrupt change 
energy level in the stimulus is not 
at the retina because of 
distortion introduced by passage 
through ocular media. Slight irreg- 
ularities of the surface of the eye, 
errors of refraction and of aberration, 
and entopic stray light all tend to re- 
duce the abruptness of the stimulus 
it is projected on the 


of 
reproduced 


gradient as 


retina. 


In Figure 1 we have attempted to 


show schematically the transformation 
from stimulus gradient to cortical dis- 
tribution. A line in the visual field 
presents an abrupt gradient of energy. 
‘At the retina the distribution of energy 
assumes the platykurtic form as shown 
jn Figure 1b. The relative flatness of 
this distribution is due to the scatter- 
ing of light by the distorting factors 
ted previously, plus the effects of 
al nystagmus. This dis- 
f light represents the prox- 
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imal stimulus which triggers the pri- 
mary receptors. If one plots the rate 
of neural firing for different retinal 
loci, then a distribution of excitation 
results. This hypothetical distribution 
is shown in Figure lc. The peaking 
of this distribution is produced by the 
combined effects of numerous mecha- 
nisms. First, neural elements located 
near the center of the distribution in 
Figure 1b are stimulated by the most 
intense light. Accordingly, the num- 
ber of fibers stimulated as well as the 
average rate of firing of these neurons 
is greater than in those neural elements 
located near the edges of the distri- 
bution. Second, as a result of physio- 
logical nystagmus, centrally located re- 
ceptors are stimulated more efficiently 
because the rate of application of the 
light energy is maximized in those 
receptors located near the maximum 
slope of the distribution of light 
(Rushton, 1935). If we assume the 
primary “carrier” of contour activity 
to be the on-off fibers, then these fibers 
serving receptors centrally located 
will have a higher rate of firing than 
those serving receptors found at the 
periphery of the population of recep- 
tors involved. Third, because of the 
faster rate of application of stimula- 
tion just noted, the amount of neural 
accommodation in these centermost 
fibers will be minimized. Neural ac- 
commodation, according to Nernst 
(1908), is a counter-process of some 
kind in the neuron which cuts short 
the effect of a stimulus. As the rate 
of application of a stimulus is in- 
creased, the amount of accommoda- 
tion is decreased (Monnier, 1934). 
Whether neural accommodation ac- 
tually operates in the vertebrate retina 
remains to be demonstrated conclu- 
sively, but there is some evidence to 
suggest that it does (Granit & Skog- 
lund, 1943). The result of the in- 
creased rate of application of the stim- 
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3 
ulus, and hence decreased accommo- 
dation, produces an additional relative 
increase in excitation rate for the neu- 
rons associated with the center of ac- 
tivity in the retina (Skoglund, 1942). 

There are, in addition, possible bases 
for peaking the excitation through the 
action of retinal receptive fields (Bar- 
low, 1953a, 1953b; Hartline, 1940; 
Kuffler, 1953). A receptive field of 
a particular ganglion cell typically is 
organized with antagonistic discharge 
patterns of the center and the periph- 
ery, that is, the center may be pre- 
dominantly "on" and the periphery 
of the field “off,” or vice versa. The 
intermediate area of the field will be 
“on-off.” Thus, any displacement of 
a border across such a field as might 
be produced by slight movements of 
the eye will result in continuous ex- 
citation of the ganglion cell. If the 
field is an ‘off-center” type and the 
gradient of stimulation passes from 
center to periphery then there wiil be 
bursts of excitation on leaving center, 
entering and leaving the intermediate 
area, and upon entering the peripheral 
part of the field. Any kind of scanning 
back and forth across this field ob- 
viously will maximize excitation of 
the ganglion cell associated with the 
receptive field. If to these complex 
events occurring at the retinal level 
we add the effects of reciprocal over- 
lap Occurring during projection, the 
distribution of excitation in the visual 
cortex is peaked further and we have 
the leptokurtic distribution as shown 
in Figure 1d, 

The phenomenon. In our early ex- 
periments, a stimulus (4) was ob- 
served moving horizontally through 
distance (D), from left to right. Its 
velocity (V) was increased until the 
edges of 4 no longer could be seen 
clearly. At this velocity, if 4 were 
exposed briefly in a fixed position prior 
to movement (this duration we called 
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t), A was seen with sharp edges. 
This effect was systematically inves- 
tigated for velocities from 0 to approx- 
imately 30 degrees of visual angle per 
second. When values of V were 
plotted against values of t, a linear 
relationship was obtained. © In order 
to maintain the contour of a moving 
stimulus, each increase in V of 1 de- 
gree visual angle per second required 
a corresponding increase in t of ap- 
proximately 25 milliseconds. 

The addition of the prior stationary 
phase to movement definitely facili- 
tated the perception of the edges of the 
moving stimulus. Stemming from a 
statistical treatment of apparent move- 
ment (Osgood, 1953, pp. 245-47 ), 
wherein two separate distributions of 
excitation in the cortex sum into one 
distribution whose peak is displaced 
in cortical space, a tentative explana- 
tion of our contour data was offered. 
The hypothesis was that excitation 
arising from the moving stimulus 
summed with the decaying activity 
around c, (the cortical locus of max- 
imum activity produced by the fixed 
stimulus) to generate a single peaked 
distribution moving through cortical 
space toward locus c, (the cortical 
locus of maximum activity produced 
by the moving stimulus at the termina- 
tion of movement). 


Tue MODEL 


ate to state some 


It is now appropri so 
our statistical 


assumptions upon which 5 4 
Concept of dynamic contour is based. 
Some of these assumptions follow 
closely those made by other statistical 
theorists; others are especially ger- 
mane to the question of dynamic con- 
tour, 


1. On-off and off rece 
Principal retinal mediators 
Perception, The excitation 
from the stimulation of these t 


ptors are the 
s of contour 
ensuing 
wo types 


of receptors becomes the cortical rep- 
resentation of a border. Because of 
their response characteristics it is as- 
sumed that on fibers play a minor role. 

2. Each of an infinite number of 
points along a border of a stimulus 
(hairline) is represented in the visual 
cortex by a normal or near-normal 
distribution of excitation (rate of firing 
of neural elements extended over cor- 
tical space). 

3. The continuum of points along 
the border of the stimulus is repre- 
sented in the cortex by a continuum 
of points of maximal excitation, the 
longitudinal extent of which is propor- 
tional to the length of the border. 

4. The mean slope of the cortical 
excitation distribution, rather than the 
absolute amount of excitation, defines 
the clarity of an edge. The sharpness 
oi the edge is determined directly by 
the relationship between the amount of 
excitation at its maximum and the 
amount of dispersion of excitation 
through cortical space. It is assumed 
that the mean slope of the distribution 
of excitation increases through time 
in a negatively accelerated fashion. 
Only when the slope equals or exceeds 
some critical value range is contour 
perceived. This implies some finite 
period of time as a necessary antece- 
dent to contour perception. 

The slope of the excitation distribu- 
tion in the cortex at any instant in time 
is determined by the intensity of light, 
contrast, and by the state of the eye 
and proximal neural elements. 

5. Overlapping distributions of ex- 
citation in the cortex summate. 

6. The mean slope of the distribution 
of excitation. also is influenced by 
velocity of movement : slope is in- 
versely related to velocity. 

7. After cessation of stimulation 
arising from a fixed or moving border, 
the mean slope of the excitation dis- 
tribution decreases through time in a 


96 W. M. Smitu and W. I. GULICK 


1.00 


075 


0.50 


025 


Mean slope of distribution of excitation 


[9] 25 50 75 


100 125 150 175 


Time (msec) 
Fic. 2. Hypothetical curve depicting the increase in the mean slope of the excitation 


distribution through time. 


conditions. 
stimulation.) 


negatively accelerated fashion. The 
mean slope at any given instant in time 
during decay is influenced by the in- 
tensity of the light: the higher the 
intensity, the steeper the slope. 


When light stimulates the retina, 
the level of cortical excitation related 
to it grows in a negatively accelerated 
manner. Moreover, and of more im- 
portance, the excitation distribution 
representing a contour increases in 
mean slope through time in a nega- 
tively accelerated fashion. This in- 
crease in slope is shown in Figure 2. 

This hypothetical function indicates 
that in approximately 50 milliseconds 
the slope (S) reaches a value neces- 
sary for sharp contour perception 
(Se). Although this temporal value is 
to some extent arbitrary, it illustrates 
here that the appearance of sharp 
edges in the visual field takes a finite 


(S.—critical slope—signifies the value necessary for the percep- 
tion of contour. The slope of the function in the fi 


In this example, contour is perceived 


gure would change with various stimulus 
about 50 milliseconds after initiation of 


period of time to develop. There is 
no single instant when a sharp edge 
appears. Rather, the transition from 
“fuzziness” to a clearly defined edge 
is a continuous matter. The value of 
50 milliseconds in Figure 2 represents 
a general threshold estimate for the 
appearance of a well-defined edge. 
Numerous factors can influence the 
rapidity with which S, is attained. 
among which are intensity level, de- 
gree of contrast, and level of retinal 
adaptation. Regardless of specific 
conditions of observation, however, the 
delay in reaching S, after the onset 
of stimulation is due to the temporally 
dispersing factors of synaptic delay, 
Teverberation, and horizontal spread 
of neural activity as it proceeds from 
the receptors to the visual cortex. 
When a border is removed from 
the visual field, its related excitation 
distribution decays with negative ac- 
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celeration. Furthermore, the slope of 
this distribution also decreases and in 
the same general manner (Assump- 
tion 7). Figure 3 represents a hypo- 
thetical decay function showing the de- 
crease in mean slope of the excitation 
distribution as a function of time. 
Note that this decay function is influ- 
enced to an important degree by the 
duration of the stimulus and its in- 
tensity. The prolonged decline in the 
mean slope is due to the continued ex- 
citation as a result of the reluctance 
of the on-off and off fibers in the 
Optic nerve to cease activity. The 
nature of this decay is based primarily 
upon evidence obtained by Hartline 
(1938) of residual activity persisting 
lor many seconds, although gradually 
subsiding. Because the slope during 
decay exceeds S, for some brief but 
finite period of time after the cessation 
of stimulation, an edge can be per- 
ceived during this brief interval. 
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The hypothetical functions depicted 
in Figures 2 and 3 represent the in- 
crease in mean slope as a function of 
time produced by an initial stationary 
stimulus (Figure 2) and the decrease 
in mean slope as a function of time 
produced by the cessation of the same 
stimulus (Figure 3). When a moving 
stimulus is presented to the eye, both 
the mean slope of excitation and the 
level of excitation increase in the same 
general manner as that already dis- 
cussed for stationary stimuli. In Fig- 
ure 4 are shown three hypothetical 
functions describing the mean slope, 
one for each of three velocities of 
movement. Assuming other condi- 
tions constant, they illustrate that the 
time required to attain S, is deter- 
mined by velocity of movement. In 
Assumption 6 it was stated that the 
mean slope of the distribution of ex- 
citation produced by a stimulus in mo- 
tion was affected by the velocity of 
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. Fic. 3. Hypothetical curve of decay in 
time. (In this example, contour is peri 
Stimulation until the slope of the excitation 
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ceived for a few milliseconds after cessation of 


distribution falls below Se.) 
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Fic. 4. Three th 
excitation distribution 
of movement increases, 


movement. At low velocities (V < 
10°/sec) the time required to attain 
Se is approximately equal to that re- 
quired to attain it with a stationary 
stimulus (Curve a). 
higher velocities Fi 
time is required before S, is attained 
(Curve b). Indeed, if V is sufficiently 
high, Se will never be reached (Curve 
c). As illustrated with these hypo- 
thetical functions, when velocities are 
very high (Curve c) the mean slope of 
the excitation distribution never 
reaches the critical value, S.. There- 
fore, clearly defined edges cannot be 
perceived. It is under these condi- 
tions that the presence of the initial 
stationary phase of the stimulation se- 
quence becomes useful. : 
What is now required is to elucidate 
the manner in which the excitation 
arising from the stationary stimulus 
sums with the excitation produced by 


However, at 
0°/sec) more 


coretical curves showing how 
changes over time for different 


more time is required for the mean slope to reach 
necessary for contour perception. At high velocities 


the increase in mean slope of the 
velocities of movement. (As velocity 
Se, the level 
Se may not be attained—Curve c.) 


the subsequent moving stimulus so 
as to maintain jointly a value of Se 
Which is sufficient for contour percep- 
tion. It is our hypothesis that the 
maintenance of contour during stim- 
ulus movements as a result of the 
presentation of the initial stationary 
phase is achieved by the summation 
of the distributions of excitation aris- 
ing from the tota] stimulation se- 
quence. Excitation resulting from the 
moving stimulus is added to the resid- 
ual remaining after the withdrawal of 
the stationary Stimulus, and their sum 
under certain conditions can be suf- 
ficient to increase the mean slope of 
the combined excitation distribution 
to equal or surpass S, at a time much 
earlier than would otherwise have ob- 
tained had S, been attained solely 0? 
the basis of excitation from movement- 

Even under conditions of static con- 
tour perception a brief interval is re- 
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Fic. 5. Theoretical curves showing how the combined distributions of excitation m 
the moving contour (regular broken lines) and residual and decaying excitation est 
initial fixed exposure of the stimulus (dot-dash lines) combine (solid erem over E 
to produce contour perception. (As time increases from the moment 9 the ep E. 
Movement to 20 milliseconds, a; 40 milliseconds, b; 60 milliseconds, c; sto, the level o 
excitation related to the moving border increases as it moves through cortical space. The 
level of excitation of the combined distribution also increases through time. Here it may 
be seen that the slope of the distribution representing the moving stimulus, Sm, increases 
through time, The slope of the combined distribution, Se, increases through time at a 
faster rate, and thereby hastens the attainment of the critical slope, Se, for contour resolution. 
. The symbol C, represents the cortical location representing the peak of excitation from 
the stationary phase, and C» the cortical location of the peak of excitation from the moving 
Phase at the end of 'movement. The abscissa represents cortical space in arbitrary units.) 
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quired before S. is reached because of 
the temporally dispersing factors al- 
ready mentioned. This interval prob- 
ably increases slightly when a stimulus 
moves slowly; but when velocity ex- 
ceeds 10 degrees or so per second, the 
interval increases markedly. Within 
the range of velocities for which the 
initial fixed stimulus is required in 
order to maintain contour, larger and 
larger residuals of excitation must be 
added to the excitation produced by 
movement if contour is to be present. 
The result of this summation is to 
increase the slope of the combined ex- 
citation distribution, and hence hasten 
the attainment of Se. 

This summation is illustrated for 
increasing temporal intervals in Figure 
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Fic. 6. When the velocity of movement 
of a stimulus is constant, the rate of dis- 
placement of the peak of excitation in the 
cortex is also constant (solid line). How- 
ever, when the decaying activity from the 
fixed stimulus sums with the growth of 
activity produced by the moving stimulus, 
the peak of the combined distribution is not 
displaced through cortical space at a con- 
stant rate (dashed line). 
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5. Note in Figure 5 that the excita- 
tion distribution for the fixed stimulus 
(dot-dash line) decays through time, 
thereby producing a distribution of 
progressively less slope. Note also 
that the slope of the excitation distribu- 
tion representing the moving stimulus 
(dashed line) increases through time, 
as does the combined distribution 
(solid line). The most important 
point illustrated by the set of curves, 
however, is that the mean slope of 
the combined distribution increases at 
a faster rate than does the mean slope 
of the excitation distribution for move- 
ment. This means that the perception 
of the edge of the moving stimulus is 
facilitated and can be seen at an earlier 
point in time than would be the case 
had there not been the contribution to 
excitation by the decay of the stimula- 
tion from the fixed stimulus, 

Figure 5 demonstrates another im- 
portant aspect of this summation proc- 
ess. According to the Marshall and 
Talbot (1942) view, borders of a stim- 
ulus are represented by the peaks of 
distributions standing for points along 
the borders or in their terms, a ridge 
of excitation is produced by an infinite 
number of points along the edge of a 
stimulus. For static contour percep- 
tion the location of the stimulus edge 
in the visual field and its cortical loca- 
tion bear a rather straightforward re- 
lationship. In dynamic contour per- 
ception, however, this relationship be- 
comes more complex. If we assume 
the eye to be fixed, when the velocity 
of movement of the stimulus object is 
constant, so is the velocity of the mov- 
mg peak of excitation as it travels 
through cortical space. In dynamic 
contour perception based upon the 
Summation of excitation distributions 
the peak of total excitation does not 
travel through cortical space at a con- 
stant velocity. In Figure 5d the peak 
of the combined distribution can be 
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seen to lag behind the peak produced 
by the moving border alone (by about 
one unit of cortical space). The dif- 
ference in the rates of displacement 
for the peak of the distribution of 
excitation produced by the moving 
border and the peak of the summed 
distribution of excitation are presented 
in Figure 6. This figure illustrates 
that the rate of displacement of the 
peak of excitation for the moving stim- 
ulus alone is constant; hence the 
Straight line. However, the rate of 
displacement of the peak of excitation 
for the combined distribution is not 
constant, but first negatively accele- 
Tated and then positively accelerated. 
It is hypothesized that this change in 
Tate of displacement of the peak of 
the distribution of the summed ex- 
Citations is responsible for a peculiar 
effect reported by our subjects; 
namely, that the velocity of the moving 
edge did not appear to be constant. 
is account of perceived variable 
Velocity with constant velocity of 
Movement we believe to be one im- 
Portant piece of supporting evidence 
or the theory presented here. 
© summarize: For low velocities 
9f movement Se is attained with suf- 
lent speed to allow virtually instan- 
aneous contour perception. With 
Velocities Se is never attained 
mor the time interval required to 
™Mplete the stimulus movement se- 
mes, For the intermediate range of 
a Ocities (8-30?/sec), Se is attained 
oe enough to permit contour per- 
ade only if the decaying neural ac- 
iis Y produced by the fixed stimulus 
“Ms with the growth of neural ac- 


tiv x : 
'Vity produced by the moving stim- 
ulus, 


Supportinc EVIDENCE 


gebe of techniques for specific meas- 
^: of the integrative activity of the 
ual system makes difficult any di- 
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rect test of statistical theory as ex- 
pressed here. However, indirect evi- 
dence of a supporting nature can be 
obtained from various perceptual ex- 
periments. We have mentioned pre- 
viously the evidence concerning the 
perceived variable velocity of a con- 
stant velocity stimulus. To other sup- 
porting evidence we now turn. 

Contrast. We have said that con- 
tour perception cannot occur unless 
the slope of the cortical distribution 
representing contour reaches some 
minimal value (Se). One factor 
which should have a direct and im- 
portant bearing on the activity of on- 
off fibers, and hence the peaking of 
the related cortical distribution of ex- 
citation, is the intensity difference rep- 
resenting a border. It was hypoth- 
esized, therefore, that under conditions 
of high contrast between a stimulus 
object and its surround, the perception 
of a moving edge could occur at veloci- 
ties of movement higher than those 
which would be expected under low 
contrast. Because high contrast would 
decrease the time required to obtain 
Se, the importance of the summative 
effects of decaying neural activity fol- 
lowing the presentation of the initial 
stationary stimulus would be mini- 
mized. That is, a stimulus of high 
contrast at a given velocity should re- 
quire less contribution from the de- 
caying effects of the initial stationary 
stimulus than a stimulus of the same 
velocity and lower contrast. The ma- 
jor contribution of the initial stationary 
stimulus is to provide, after its cessa- 
tion, neural excitation which may sum 
with that produced by the moving 
stimulus. This summation effect sim- 
ply hastens the attainment of Se during 
movement (see Figure 5). 

Tn terms of our operations the result 
of an increase in contrast w ould be to 
allow the perception of contour of 
the moving stimulus at a higher 
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Fic. 7. The effects of contrast and illu- 
mination level on contour perception of 
moving stimuli. (Data of five subjects. 
Brightness values in footlamberts of the 
background and the stimulus: d, 4.2-1.2; 
€, 4.2-1.5; b, .036-.007; a, .036-.015. The 
stimulus was always darker than the back- 
ground. Stimulus size equaled 0.5 degrees, 
and the extent of movement was 5.0 degrees 
of visual angle. Each curve in the figure 
specifies how long the stimulus had to ap- 
pear in a fixed position prior to movement, 
t; in order for contour to be perceived dur- 
ing movement of a particular velocity.) 


velocity than under conditions of low 
contrast, even though the duration of 
the fixed stimulus remained constant. 
Conversely, for a given velocity of 
movement, a reduced exposure interval 
of the initial fixed stimulus is adequate 
to maintain contour perception during 
movement whenever contrast is in- 
creased. 

Data on the effects of contrast are 
presented in Figure 7. For each of 
five subjects, velocity of movement 
was increased from 8 degrees per 
second in Z-degree steps until contour 
was lost (about 13?/sec). There- 
after, the value of t, required to see 
contour was determined for each in- 
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crease in V. In Figure 7 mean values 
of t, are plotted against V for each of 
two contrast conditions at each of two 
levels of illumination. These data 
show that increased contrast selectively 
displaces the time-velocity contour 
function in the expected direction: 
higher contrast enables the subject to 
perceive the edge of the moving stim- 
ulus at higher velocities with a con- 
stant value of t,. Under low illumina- 
tion the increase in contrast shifted 
the function from a to b, and under 
high illumination, the increase in con- 
trast shifted the function from c to d. 

Intensity level. The data plotted in 
Figure 7 also show an effect one 
would predict from statistical theory; 
namely, that contour perception of 
stimuli in motion is influenced by in- 
tensity. In Assumption 7 we stated 
that the mean slope of the decaying 
distribution of excitation became 
steeper as stimulus intensity increased. 
Furthermore, the rise in the mean 
slope of the distribution of excitation 
from the moving stimulus with in- 
creased intensity will be faster and will 
hasten the attainment of Se. Accord- 
ingly, the importance of the summative 
effects of the initial stationary stim- 
ulus would be reduced, 

As seen in Figure 7, the result of 
increasing intensity shifted the time- 
velocity contour function in the ex- 
pected direction; under low contrast, 
increased intensity shifted the function 
from a to c, and under high contrast 
an increase in intensity shifted the 
function from b to d. 

Maintaining a constant contrast be- 
tween the stimulus and its surround 
while increasing intensity should and 
does facilitate contour perception, but 
only up to a point. If adjacent areas 
Of a visual pattern differing in in- 
tensity have their intensities increased 
proportionately, a level will be reached 
where the necessary conditions for sta- 
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tistical peaking will no longer be pos- 
sible. Thus if the areas defining a 
contour are both sufficiently intense, 
on-off receptors would not respond 
and the presumed normal distribution 
of excitation would degenerate. 

Spatial displacement. Our account 
of how decaying neural activity from 
fixed stimulation combines with that 
produced by a moving stimulus re- 
quires particular spatial and temporal 
relationships of these two parts of the 
stimulation sequence. Any change 
which destroys the spatial and tem- 
poral contiguity of the two phases 
should have an adverse effect upon 
Contour formation. 

We predicted that the introduction 
of a spatial separation in the visual 
field between the position of the fixed 
stimulus and the initial position of the 
Moving stimulus would require a 
higher value of t, to allow contour 
Perception during movement than 
Would be required in a control condi- 
tion without spatial separation. The 
rationale for the separation in the 
Visual field of the two phases of the 
Stimulation sequence was to separate 
the assumed distributions of excitation 
Produced by them in the visual cortex. 
It was necessary to provide the sub- 
ject with a minute fixation point lo- 
cated at the place where movement 
began, because without it the subject 
Probably would have tended to view 
the entire stimulus sequence with the 
Same area of the retina. Following the 
Cessation of the stationary stimulus, the 
Moving stimulus appeared at the point 
of fixation and moved from left to 
Tight, 

If the neural elements involved in 
the excitation produced by a stationary 
Stimulus comprised a totally different 
Population of elements from those in- 
volved in excitation from a moving 
Stimulus, then no summation could 
take place. On the other hand, max- 
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imum summation would occur when 
identical elements comprised both pop- 
ulations. It was predicted that the 
greater the spatial displacement (the 
less the summation), the more adverse 
would be the effect upon contour 
formation. 

Data obtained confirmed the spatial 
displacement hypothesis. Figure 8 
presents the time-velocity functions ob- 
tained under several experimental con- 
ditions. These data are from eight 
subjects, and the procedure employed 
in obtaining the data was the same as 
that indicated previously. — (Differ- 
ences in the origin of the time-velocity 
functions which exist from one experi- 
ment to another are assumed to be 
due to individual differences, and dif- 
ferences in illumination and contrast.) 
The control condition consisted of hav- 
ing the stationary stimulus appear at 
the same place in the visual field from 
which movement began.  Displace- 
ments were measured in degrees of 
visual angle at 1 and 2 degrees right, 
and 1 and 2 degrees left. Displace- 
ments of equal extent to the right and 
left produced equivalent shifts in the 
functions and so data were combined 
across this variable. 

Temporal displacement. If there 
were a time lapse between the cessa- 
tion of the initial stationary stimulus 
and the onset of the moving one, then 
subsequent summation or neural ac- 
tivity should be reduced. This reduc- 
tion in summation would increase the 
time required to obtain S, and would 
minimize the influence of the stationary 
stimulus upon the perception of con- 
tour of the moving phase. 

Judgments of contour relating to 
this issue were obtained for eight sub- 
jects under each of three conditions. 
In the control condition the two phases 
of the stimulus sequence were tem- 
porally contiguous, where in the other 
two conditions temporal delays of 40 
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Fic. 8. The effects of spatial separation 
of the two phases of stimulation on dynamic 
contour. (Combined data of eight subjects. 
Stimulus brightness, 20 footlamberts ; 
brightness of background, 70.0 footlamberts. 
Stimulus size, 0.5 degree. Extent of move- 
ment, 5 degrees of visual angle. Increasing 
the spatial separation between fixed phase 
of stimulation and moving phase required 
a greater exposure duration of the fixed 
phase in order to retain contour perception 
for a particular velocity of movement. Dis- 
placements to the right and left produced 
equivalent effects. Therefore, data were 
combined across this parameter.) 


and 80 millseconds were employed. 
We predicted that the temporal delays 
would shift the time-velocity functions 
to the left indicating, that for a par- 
ticular velocity, longer exposures of 
the stationary stimulus would be re- 
quired to produce contour formation. 
The amount of neural activity available 
to sum with that produced by the 
moving stimulus is determined not 
only by temporal displacement of the 
two phases of the stimulus sequence, 
but by the duration of the stationary 
stimulus as well. Accordingly, we 
hoped to obtain data suggesting that 
what is lost to the summative process 
as a result of temporal delay can be re- 
covered by increasing the value of t,. 


Data relevant to this problem are 
presented in Figure 9. The shift in 
the contour function as a result of tem- 
poral displacement is apparent. The 
greater the delay, the larger the shift 
to the left from the control condition. 
Presumably, a delay longer than 80 
milliseconds might produce a time- 
velocity function asymptotic to the 
ordinate, suggesting that beyond a cer- 
tain duration of temporal displacement, 
increased durations of t; would cease 
to influence the perception of contour. 
Increased values of t, can compensate 
for temporal displacement provided 
that the latter is not too long. For 
example, at a velocity of 18 degrees 
per second the duration of the sta- 
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, Fic. 9. The effects of temporal separa- 
tion of the two phases of stimulation on 
dynamic contour. (Combined data of eight 
Subjects. Brightness of stimulus, 2.0 foot- 
lamberts ; brightness of background, 70.0 
footlamberts. Stimulus size, 0.5 degree of 
visual angle. Extent of movement, 5 de- 
grees. For a given velocity of movement, 
the exposure duration of the fixed phase © 
stimulation must be progressively increase 
as temporal separation increases in order 
to retain contour.) 
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tionary stimulus necessary for cọntour 
perception during movement was 80 
milliseconds for the control condition 
(no temporal displacement). How- 
ever, with temporal displacements of 
40 to 80 milliseconds, the durations of 
the fixed phase had to be increased to 
approximately 100 and 120 millisec- 
onds, respectively. 


DISCUSSION 


_ The conception of contour percep- 
tion which we have offered, while con- 
Sruent with certain facts concerning 
the visual system and with particular 
experimental results, has not directed 
Itself to various considerations of a 
complicating and relevant nature. 
these considerations include the ques- 
tion of eye movement, the locus of the 
Summation process, the question of 
Neural inhibition, and some general 
questions about — neurophysiological 
mechanisms. 

Eye movements. The perception of 
edges of moving stimuli obviously 
Taises the problem of how ocular track- 
Ing is related to the formation of con- 
tours, Surprisingly little empirical 
knowledge is available regarding the 
nature of ocular tracking, although 
One of the authors is currently inves- 
tigating this problem. Having avail- 
able a photoelectric system of measur- 
Mg eye movements which is coupled 
9 a system for generating moving 
targets, precise measures can be made 
9f ocular tracking and these then re- 
ated to perceptual judgments of tar- 
Bets whose velocity, displacement, and 
Other attributes can be varied (Smith 

Warter, 1959, 1960). Investiga- 
lons to date indicate quite clearly that 
9T the border of a moving target to 
cu Seen clearly a high. degree of ac- 
Wide in visual tracking is mol re- 
on] ed. In fact, it appears that it is 

Vy for the very slow velocities (con- 
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stant) of movement (about 10?/sec or 
less) that the eye tracks with reason- 
able accuracy. The upper limit of 10 
degrees per second, it is important to 
note, is approximately the velocity of 
movement at which the borders of 
the moving stimulus begin to be seen 
as blurred when the duration of the 
stimulus in a fixed position before 
movement (t,) is zero. It will be re- 
called that the most important single 
fact reported in our contour experi- 
ments is that by increasing the interval 
of exposure of the stimulus in a fixed 
position prior to movement we can 
systematically increase the upper limit 
of velocity at which the edge of the 
stimulus still can be seen clearly. A 
basic question raised by our experi- 
ments concerns how an increase in t, 
affects visual tracking. A provisional 
answer to this question based upon eye 
movement data to date is that increas- 
ing the magnitude of t, improves visual 
tracking generally but not to a level 
of accuracy characteristic of tracking at 
low velocities. About all we can con- 
clude at this point is that the facilita- 
tion of dynamic contour perception by 
exposure of the target in a fixed posi- 
tion prior to movement is in some 
manner directly related to the accuracy 
of visual tracking. We can be sure 
that in our contour experiments the 
displacement of the stimulus at a con- 
stant velocity did not produce on the 
retina a displaced locus of maximum 
excitation which also was displaced 
at a constant velocity. We must as- 
sume that the cortical representation 
of a contour must contain components 
of acceleration and deceleration not 
found in the original stimulus. This, 
of course, is due to irregularities in the 
pursuit movement of the eyes under 
conditions wherein the moving stimu- 
lus alone is presented, and it is due to 
inaccurate tracking and the summation 
process under conditions in which both 
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phases of the stimulation sequence are 
presented (see Figure 6). 

Based upon eye movement data we 

can offer a suggestion concerning the 
possible relation between dynamic con- 
tour perception and correlated visual 
tracking movements. If one examines 
the magnitudes of lag and lead errors 
of the eye as the velocity of a stimulus 
increases, it is obvious that the mag- 
nitude of these errors increases as 
velocity increases, and at any given 
velocity it decreases at t, is increased. 
It is our hypothesis that as long as 
the lag or lead error does not exceed 
a few degree of angle (3—4 degrees at 
most), contour perception can be 
maintained. If this hypothesis is sup- 
ported by the experiments now in 
progress, it will converge nicely with 
the facts about the maximum angular 
separation over which optional ap- 
parent movement can be obtained (De 
Silva, 1929), and the maximum dis- 
tance over which one contour can exert 
an inhibitory influence over another 
(Fry & Bartley, 1935). Moreover, 
it is very interesting to note, and per- 
haps very important that the figure of 
3-4 degrees angular magnitude is just 
about the visual field size represented 
by the measured size of receptive fields 
as revealed by Kuffler (1953). 

The data obtained so far on the 
problem of eye movements tend to 
complicate the model, but do not viti- 
ate it. Since the data on eye move- 
ments are at present incomplete we 
have assumed in the description of the 
summation process (see Figure 5) that 
the eye lags behind the target in the 
hypothetical situation depicted. If 
our eye movement data provide us 
with data contrary to this assumption, 
then the effect would be to augment 
the summation process by keeping the 
growth of neural excitation produced 
by the moving stimulus nearer the 
population of neural elements under- 


going decay following the cessation 
of the stationary stimulus. Again, the 
important point is that the general 
statement of the model can accommo- 
date the complicating factor of eye 
movements. 

Locus of summation. Since the first 
statement of statistical theory by Mar- 
shall and Talbot (1942), they and sub- 
sequent statistical theorists have placed 
the locus of contour resolution in the 
striate cortex. The primary role of 
the projection system has been seen 
as contributing to the peaking of the 
distribution of neural excitation. The 
locus of summation of different dis- 
tributions of excitation, however, has 
been the visual cortex alone. The 
question may be raised, however, as 
to the possibility of summation at 
lower levels. In an effort to ascertain 
the likelihood of this possibility we 
executed an experiment in which the 
stationary stimulus was presented to 
one eye whereas the moving stimulus 
was presented to the contralateral eye 
(Gulick, 1960). The results of this 
experiment suggest that for maximum 
contour facilitation it is necessary to 
present the stimulation sequence in 
such a way as to allow neural inter- 
action and/or summation at the retinal 
level as well as at the cortical level. 
The implications of these data for the 
model involve a possible extension of 
the present view of neural summation 
as it is conceived in current statistical 
theory. 

The interaction (and subsequent 
neural summation) of two temporally 
and spatially separate visual events 
Which is central to our position raises 
in addition certain questions which re- 
quire further investigation. We have 
presented evidence concerning the ef- 
fects of separating in space and time 
the initial exposure of the stimulus ™ 
a fixed position and the movement 
component. It is necessary to ask 
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what effects upon the perception of 
edge of the moving stimulus would be 
observed if the fixed and moving com- 
ponents of presentation differed in 
terms of certain attributes which have 
yet to be studied. For example, sup- 
pose that the stationary stimulus prior 
to movement were exposed out of 
focus, to be followed by the moving 
phase with the stimulus in focus. 
What then would be the change in our 
typical relation between velocity and 
tı as far as perceiving the moving edge 
1S concerned? Similar important ques- 
tions also need to be studied once we 
have completed the instrumentation 
necessary to present stimuli in this 
manner, 

Neural inhibition and facilitation. 
As a final point of discussion we real- 
ize, as must the reader, that our con- 
Ception of contour and its expression 
In neurological and physiological terms 
1s unbalanced. The theory rests prin- 
Cipally upon the concept of neurol sum- 
Mation and says little, if anything, 
about related concepts of inhibition and 
facilitation. In view of the inhibitory 
effects that one contour can have upon 
an adjacent one when both are fixed 
™ position (Fry & Bartley, 1935), we 
are acutely aware that similar effects 
may be present when a particular con- 
tour is displaced relative to its original 
Position, We think it reasonable, for 
example, to assume that an inhibitory 
Mechanism can account for the “damp- 
Ing" of residual retinal activity corre- 
lated with the "trailing" edge of any 
Moving stimulus. In any event we 
assume that future and more sophis- 
ticated expressions of statistical theory 
Will incorporate modes of neural inter- 
action in addition to summation at the 
Cortex, 
Regarding possible future expres- 
s of statistical theory of contour 
th; Can indicate briefly some of the 

gs we believe ought to be done. 


Si 


Additional perceptual studies are 
needed in which systematic investiga- 
tions of the effects of differences in the 
fixed and moving components of the 
stimulus presentation are made. More- 
over, we need to know more about the 
relationships between tracking mo- 
tions of the eye and various conditions 
of stimulus movement. Finally, and 
most important, direct evidence bear- 
ing upon the neurological basis of 
perception is required if we are to 
continue to use with confidence the 
statistico-neurological assumption upon 
which the theory rests. While con- 
gruent with some neurological evi- 
dence, statistical theory of contour 
perception is far from being firmly 
based upon unambiguous neurological 
and physiological data. And, of 
course, the theory cannot find its real 
tests in perceptual data alone. We 
must continue to search for direct 
measures of the integrated activity of 
the visual system itself in relation to 
the data of perception. By so pro- 
ceeding we will determine if statistical 
theory will prove itself as a fruitful 
means of conceptualizing basic prob- 
lems in perception. 
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SUBJECTIVE PROBABILITIES INFERRED 
FROM DECISIONS! 


WARD EDWARDS 


Engineering Psychology Laboratory, University of Michigan 


Theories about how people make 
decisions in risky or uncertain situa- 
tions have come to focus on two con- 
cepts: utility, or subjective value, and 
subjective probability. The status 
of the utility concept is fairly clear 
(for reviews, see Edwards, 1954c, 
1961). But the status of the sub- 
Jective probability concept is quite 
confused, both theoretically and in 
relation to experiments. Too often 
the concept of subjective probability 
has been introduced casually into a 
decision theory which focuses on the 
Concept of utility. In fact, the only 
thorough treatment of a concept of 
subjective probability (quite different 
from the one to be discussed in this 
Paper) is L. J. Savage's (1954) bril- 
liant book. The notion of subjective 
Probability would be necessary in 
PSychology even if it played no role 
in decision theories, since men often 
make judgments about probabilities 
ased on displays of random or non- 
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related paper (Edwards, 1959, 1960) is closely 


random events. But this paper is 
mostly concerned with subjective 
probabilities inferred from decisions 
via some kind of value or utility 
maximization model of the decision- 
making process. 

Psychologists trained in psycho- 
physics tend to think that subjective 
probability is related to objective 
probability in more or less the same 
way that the subjective loudness of a 
tone is related to its objective in- 
tensity. The purpose of this paper is 
to examine the content, merits, and 
limitations of such an approach. The 
discussion will focus on two closely 
related matters. The first is the 
idea of a set of functions relating 
subjective to objective probability. 
The second is whether or not the sub- 
jective probabilities of a set of mu- 
tually exclusive events, one of which 
must happen, should add up to one. 

The paper begins by defining two 
classes of decision theories. After 
some preliminary discussion of utility 
and subjective probability functions, 
it next considers the class of theories 
which result when subjective proba- 
bilities are assumed to add up to one. 
This class turns out to have some 
serious difficulties. A brief review of 
experimental evidence provides em- 
pirical reasons for avoiding these 
difficulties by rejecting additivity. 
Then the paper examines models 
which do not require subjective proba- 
bilities to add up to one. Such models 
require a reformulation of the concept 
of utility; utility scales must have 
true zero points. Finally the paper 
derives two experimentally testable 
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properties which any plausible utility- 
subjective probability model, addi- 
tive or otherwise, must have, and 
presents a model which I find at- 
tractive. 


MAXIMIZATION AND UTILITY 


First, some definitions and assump- 
tions. This paper is concerned with 
decisions of the following kind. A 
decision maker can choose one and 
only one of two or more courses of 
action. Associated with each course 
of action is a finite set of possible 
outcomes, one and only one of which 
will happen if that course of action is 
chosen. Each possible outcome can 
be described by a number, called a 
utility, defined at least up to a linear 
transformation (i.e., measured at least 
on an interval scale); this number 
fully represents the desirability of that 
outcome to the decision maker. If 
more than one possible outcome is 
associated with a given course of 
action, then a well defined external 
event, over which the decision maker 
has no control, determines which 
outcome will occur if that course of 
action is chosen. Most nonsocial 
decisions fit this description. 

The crucial assumption of the class 
of theories considered here is that 
people behave as if they were attempt- 
ing to maximize some quantity— 
perhaps not skillfully, perhaps not 
consistently, but still attempting to 
maximize. Maximize what? The 
following equation defines the con- 
cept of subjectively expected utility 
(SEU): 


SEU = Ð yiu; [i] 


It refers to a course of action which 
has a number of possible outcomes. 
The ;^^ possible outcome has an objec- 
tive value to which corresponds a 
subjective value or utility a; Out- 
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come 7 will occur if a given external 
event occurs; that event may have an 
objective probability, or the objective 
probabilities of one or more of the 
relevant set of external events may 
be undefined. In either case, out- 
come i has a subjective probability 
V; The theories with which this 
paper is concerned simply assert that 
subjects (Ss) behave as though they 
choose, from among the courses of 
action open to them, the one which 
has the largest SEU. For more ex- 
tended discussion of the origin of 
these models, see Edwards (1954c, 
1955). This paper will treat them as 
models of what men do (descriptive 
models) rather than of what they 
should do (normative models). 

Equation 1 introduces formally the 
central concept of this paper: sub- 
jective probability. What is a sub- 
jective probability? It is a number 
between zero and one which describes 
a person's assessment of the likeliness 
of an event. Exploration of several 
more explicit definitions follows. 

This paper assumes that objective 
probabilities exist, are known for some 
events, and are distinct from subjec- 
tive probabilities, though subjective 
probabilities are related to objective 
ones. Some theorists like Savage 
(1954) have asserted that there is no 
such thing as objective probability ; 
all probabilities are subjective. This 
is no place to argue whether or not 
objective Probabilities can be mean- 
ingfully defined. The argument is 
irrelevant. Some events, like the toss 
of a coin or the roll of a die, have well 
established "conventional" probabili- 
ties. Whether these probabilities are 
objective or not, they can meaning- 
fully be compared with the subjective 
Probabilities revealed when people 
make decisions which involve these 
events. 


Of course many events exist which 
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are neither certain nor impossible but 
for which no objective or conventional 
probability can be defined. Example: 
Within the next 24 hours, you will 
drink a glass of wine. Such events are 
called uncertain, while events for 
which objective probabilities can be 
defined are called risky. This paper 
will not consider uncertain events. 
The same decision theory ought to 
apply to decisions whose outcomes 
are contingent on uncertain events as 
to decisions whose outcomes are con- 
tingent on risky events, and the latter 
are more helpful in examining the 
nature of that theory. 

The notion of a set of functions 
relating subjective to objective prob- 
ability is vital to this paper. But the 
theorems proved below are relevant 
also to theories which deny or ignore 
the existence of such a relationship. 
They imply for such theories the 
existence of severe restrictions on per- 
missible relationships between sub- 
Jective probabilities and classes of 
risky events (classes defined by ob- 
Jective probability values). 

From here on, some abbreviations 
are needed. The word *maximiza- 
tion” will be omitted from phrases 
like “the SEU maximization model.” 

he concept of objective probability 
Will be symbolized OP in text; SP will 
Stand for subjective probability. The 
Concept of a set of events of which 
Some one must happen and not more 
than one can happen—that is, the 
Concept of an Exclusive, Exhaustive 
Set of Events—will be symbolized 

E in text, Two classes of SEU 
Models exist, Those in which the 

S of an EEE must add up to a 
Constant will from here on be called 

SEU (additive subjectively expected 
utility maximization) models. Those 
in which the SPs of an EEE do not 
peed to add up to anything in particu- 
EL (though of course the OPs of an 

EE must always add up to one) will 
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be called NASEU (nonadditive sub- 
jectively expected utility maximiza- 
tion) models. 

The subject of transformations will 
come up. Psychologists speak of three 
classes of scales, defined by three 


transformation groups, which are 
relevant here. An interval scale 
permits the linear transformation 


x’ =ax+b, a0. A ratio scale 
permits the scale transformation 
x’ =ax, a 7» 0. An identity scale 
permits only the identity transforma- 
tion x! = x. This paper will use the 
psychological rather than the mathe- 
matical names for these classes of 
scales. 


CLASSES or SEU Mopzrs 


This section will identify two main 
classes of SEU models, ASEU and 
NASEU, and will then examine some 
extraordinary restrictions on the util- 
ity functions which must enter into 
each. The difference between ASEU 
and NASEU models is defined by 

Theorem 1: If only interval, ratio, 
and identity scales are considered, in 
any SEU model in which the SPs of 
an EEE must add to a fixed constant 
(ASEU model) utility must be meas- 
urable at least on an interval scale 
and SP must be measurable at least 
on a ratio scale. In any SEU model 
in which no restrictions are put on the 
sum of the SPs of an EEE (NASEU 
model), utility and SP must both be 
measurable at least on ratio scales. 

Consider the NASEU models first. 
Since the transformations under con- 
sideration are successive restrictions 
of the general linear transformation, 
consider the least restrictive possi- 
bility first. In that case, the trans- 
formation w =au +b, a>0 on 
utility and y’ = oj + d, c > 0 on SD 
are permissible. Consider two courses 
of action, A and B, which have equal 
SEUs. (Continuity assumptions not 
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spelled out here are necessary to guar- 
antee that such events exist and that 
they do not have certain pathological 
properties. Axioms 1 and 2 of the 
mathematical Appendix, plus the 
assumption that there are at least 
three possible outcomes with different 
utilities, would be more than enough.) 
Permissible transformations must not 
change equal SEUs, which imply 
indifference between 4 and B, to 
unequal SEUs, which imply that one 
is preferred to the other. Conse- 
quently : 


X443u4)-XG)4)o 
Ev (Adu (A)=LDy(Byw'(B) [2] 


A; = i" possible outcome of course of 
action A; i = 1, 2, ==, I. 

B; = j" possible outcome of course of 
action.B, j = 1,2, «««,. 7. 

Substituting the transformations 


under consideration into the second 
half of Equation 2 produces: 


25 [GLA] + d) (aula 1] + 0)] 
= X [(GEB;] + d) (au[B;] + b)] 


Multiplying and collecting terms: 
acl yA ul) — 3:4 (B)u(B5)] 
+ [E y(A) — p» v(Bj)] 
+ ad[ E u(Ai) — 2 u(Bj)] 
Tbd[I— J]- 0 [3] 
By hypothesis: 
E y(u) — X V(B)u(3) = 0 


and so the first term of Equation 3 
vanishes. Since both b and d may 
assume any arbitrarily chosen values, 
the only way to guarantee that the 
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sum of the other three terms will be 
zero for any values of b and d is to 
require that the quantity inside the 
brackets in each term be zero. But 
the quantity inside the brackets in 
the second term (and in the third also) 
is explicitly not required to be zero. 
Consequently this set of transforma- 
tions cannot be permissible. Since 
both a and c are required to be greater 
than zero, the only way to guarantee 
that Equation 3 will be correct is to 
require that both b and d be zero. 
"That, of course, means that both 
utility and SP must be measured on 
ratio scales. "Thus the second half of 
Theorem 1 is established. 

Now consider the first half of 
Theorem 1, which is concerned with 
ASEU models. In such models, the 
first and second terms of Equation 3 
are zero by hypothesis. Furthermore, 
it is clear that in such models SPs may 
not have an arbitrary origin, since 
they are required to add up to a fixed 
constant, and consequently d must be 
zero. If dis zero, the other two terms 
vanish, and no restrictions on b are 


necessary. This establishes the first 
half of Theorem 1. 


Possible Relations between Utilities and 
Objective Values 


Luce (1959) has recently pointed 
out some surprising and severe restric- 
tions on possible relationships between 
objective and subjective continua of 
any sort, in a paper of extraordinary 
depth and importance, He asserts 
two Principles to which any function 
relating an independent to a depend- 


ent variable within a theory should 
conform. First: 


Admissible transformations of one or more 
of the independent variables shall lead, via the 
substantive theory, only to admissible trans- 
formations of the dependent variables. (P: 
85.) (Note that a form of this principle was 
used to prove Theorem 1.) 
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Secondly : 


except for the numerical values of param- 
eters that reflect the effect on the dependent 
variables of admissible transformations of the 
independent variables, the mathematical 
structure of the substantive theory shall be 
independent of admissible transformations of 
the independent variables. (p. 85.) 


The first condition means, for ex- 
ample, that a theory in which the 
dependent variable is measured on a 
ratio scale may not require that when 
the arbitrary origin of an interval 
Scale independent variable is changed, 
the origin for the dependent variable 
Must change also, The second, more 
Subtle, condition means that the 
Mathematical form of the function 
relating independent to dependent 
Variables may not be changed from, 
for example, a power function to a 
Simple sum of variables by permissible 
transformations on the independent 
Variables. Both of these principles 
of theory construction seem plausible 
and even necessary. What do they 
imply for utility and SP? 

First, consider interval scale utility 
as dependent variable and ratio scale 
Money (or quantity of any other con- 
Unuously variable good) as inde- 
Pendent variable. The relation be, 
tween these two variables, if it is t 
Conform to the two requirements of 

e previous paragraph, must, ac- 
Cording to Luce's theorems, have the 
orm u($,) = a log $: +b, where a 
and b are arbitrary constants and $; is 
any amount of money. If, as is neces- 
Sary in NASEU models, utility is 
Measured on a ratio scale, the rela- 
'On between it and money must, 
‘ ing to Luce’s theorems, be 
n GS; = a$: (again, a and b are arbi- 
"ary constants). . 

lese restrictions on the possible 
LE of utility functions are astonish- 
bie Y Severe. For example, pe om 
as if for interval scale utilities 
€ are any points on the function 
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relating utility of money to its objec- 
tive value at which that function 
increases (that is, if @ is positive), and 
if the value or b is finite, which it must 
be if the equation is to make sense, 
then the utility corresponding to zero 
dollars (or some fixed positive or 
negative amount, since the origin 
may be displaced by a fixed amount) 
is minus infinity. The utility of nega- 
tive amounts of money (or amounts 
less than the fixed amount) is unde- 
fined. And the form of the utility 
function is necessarily negatively ac- 
celerated; that is, it is impossible to 
find a person who attaches as much as 
twice as much utility to $2x as he does 
to Sx. In short, interval scale utilities 
associated with ratio scale money 
values require ridiculous utility func- 
tions. 

The relation between ratio scale 
utilities and ratio scale money values 
is much less ridiculous. Its most 
important restrictive characteristic is 
that it may have only one inflection 
point. That means that if the func- 
tion is ever positively accelerated, it 
is always positively accelerated; if it 
is ever negatively accelerated, it is 
always negatively accelerated. Even 
this characteristic is pretty restrictive. 
But it is not utterly ridiculous, 
as are the properties of interval 
scale utilities discussed in the previous 
paragraph. P 

These restrictions are so surprising 
that one is naturally tempted to try 
to wiggle out of them by arguing 
against the assumptions which lead to 
them. Luce (1959) has extensively 
discussed these assumptions and their 
possible limitations. This paper will 
not cover the same ground again; 
instead, I wil simply record my 
opinion that at least for interval 
scales Luce's restrictions are inevi- 
table and we must therefore learn to 
live with them. 

Do these principles of theory con- 
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struction place any restrictions on 
possible relations between SP and 
OP? Probably not. We usually 
consider OP to be measured on an 
identity scale; if it is, Luce's argu- 
ments place no special restrictions on 
the mathematical form of possible 
relationships between it and anything 
else. 


GENERAL CHARACTERISTICS OF 
PSYCHOPHYSICAL DECISION 
MopELs 


Before getting down to specific 
issues concernings SPs, I must set 
the stage by discussing some general 
issues and some characteristics which 
all ASEU or NASEU models must or 
should have. 


Identifying Characteristics 


To predict choices from an SEU 
maximization model of any kind, you 
must know the utility and the SP of 
each possible outcome of each avail- 
able course of action. You can then 
substitute these values into Equation 
1, perform some arithmetic, and pre- 
dict that the course of action with the 
highest SEU will be chosen. Such 
predictions will be wrong on occasion. 
This fact can be included in the model 
(at a heavy penalty in additional 
mathematical complexity) by making 
it stochastic, or it can be excluded 
from the model and left to a general 
theory of errors, as has been cus- 
tomary in most of psychophysics. 
This paper does the latter, but it 
seems likely that many of the conclu- 
sions established here for deterministic 
models will apply to stochastic models 
of the same type as well. 

In order to use an SEU model, then, 
you need to know utilities and SPs 
independently of the decision you are 
trying to predict. How can you find 
them? That is a very difficult, much 
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debated question. For the purposes 
of this paper, it is enough to assume 
that some psychophysical method is 
available which gives accurate utility 
and SP measures for any object or 
event of interest, and that you have 
available such information about any- 
one whose decisions you are predicting. 

In what form can such information 
be available? Consider first informa- 
tion about utilities. If money is the 
valuable object in question, presum- 
ably a table could be prepared show- 
ing the amount of utility an .S at- 
taches to a gain of one dollar, of 
two dollars, of three dollars, and so 
on. (My reasons for saying "a gain 
of one dollar" rather than simply the 
amount one dollar or a total fortune 
of $5001 will become clearer later; 
the differences among these formula- 
tions are irrelevant at this point.) 
But the information in that table 
would have to be based on experi- 
ments in which the S received or lost 
particular dollar bills, with particular 
serial numbers. How do we know 
that he will respond similarly to differ- 
ent dollar bills, or perhaps even to 
nickels and dimes instead? Of course 
we don’t. We simply assume that 
(within reasonable limits) the form 
in which S receives the increment of 
one dollar is irrelevant; that is, we 
abstract the property "United States 
currency with an objective value of 
one dollar" from a variety of other 
Properties of the stimuli used in the 
utility measurement experiment, and 
assume that it is the one on which the 
results most significantly depend. 

In the case of utility, the point 
made above is so familiar as to be 
trivial. But what about the case of 
SP? On what properties of events do 
SPs most significantly depend? The 
most obvious characteristic of events 
Which should turn out to be highly 
correlated with SP is, of course, O 
(remember that this paper is re- 
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stricted to classes of events for which 
OPs are well and independently de- 
fined). Unfortunately, unpublished 
data make it perfectly clear that if 
two bets are identical in payoff, OP, 
and all other characteristics except the 
means by which the OP is displayed, 
S may markedly prefer one to the 
other—so much so that he is willing to 
pay for the privilege of using his pre- 
ferred display, even though he loses 
in expected value (hereafter abbre- 
viated EV) by doing so. Within the 
framework of the theories discussed 
in this paper, this finding can be ex- 
plained only by assuming that several 
different SPs may correspond to a 
given OP, depending on the nature of 
the OP display. In short, any at- 
tempt to state SP as a function of OP 
alone is doomed to failure. 

What then is SP a function 
of? Mathematical decision theorists, 
aware of logical difficulties which 
result from thinking of SP as a func- 
tion of OP, often use a quite different 
approach. They think of SP as a 
characteristic of interaction between 
a specific event and a specific person. 

is, of course, means that in order 
to make a prediction they must inde- 
Pendently determine the SP asso- 
Clated with the particular event about 
Which the prediction is made for the 
Person whose behavior is to be pre- 
dicted. Such a strategy for theorizing 
has two major difficulties. The first 
's that it makes little attempt at 
8enerality, Obviously some charac- 
€ristics of an event are important in 

€termining its SP and some are not; 
We need to know which are which. 

Urthermore, if SPs are to be used to 
Predict real-world decision making 
havior (e.g., for designing displays 
m military information processing 
Systems), some set of rules which 
Severn the relationship between event 

"d resulting SP must be found so 


that Predictions can be made without 
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running experiments on each situa- 
tion and person to which the predic- 
tions are intended to apply. 

What we need and do not have is 
some systematic empirical informa- 
tion about various displays of proba- 
bilities. In the absence of identified 
stimulus dimensions relevant to SP 
other than OP, decision theorists are 
in as awkward a position as psychol- 
ogists interested in loudness would be 
in if nothing were known about the 
physics of sound. Of course this 
position can only be made worse by 
ignoring the one stimulus variable 
which we know to be highly corre- 
lated with SP and therefore im- 
portant: OP. But OP is not enough. 
Decision theorists cannot much longer 
get along without empirical informa- 
tion about the psychophysics of proba- 
bility displays. 

Since we have no useful information 
about displays of probabilities, this 
paper can only assume that they are 
important but leave their effects un- 
specified. The following discussion 
assumes that each event to which a 
decision theory might be applied has 
certain as yet unspecified character- 
istics which determine its SP. One 
of those characteristics is, of course, 
OP, whenever OP is defined. The 
others are unknown; this paper will 
call them identifying characteristics. 
Only two assumptions about identify- 
ing characteristics will be consistently 
made. The first is that whenever an 
event is repeated as exactly as pos- 
sible, its identifying characteristics 
are unchanged ; in other words, identi- 
fying characteristics are recurrent, not 
unique. (The contrary possibility 
would make them inaccessible to 
scientific study.) The second assump- 
tion is that it is quite possible to 
have events with different identifying 
characteristics but the’same OPs; this 
simply means that OP is not perfectly 
correlated with identifying charac- 
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teristics. (lf it were, then Theorem 3 
below would imply that SP always 
equals OP, and so the point of the 
notion of identifying characteristics 
would be lost.) 


The SP Function Book 


Mathematically, this paper will 
conceive of SP as a function of OP 
and of identifying characteristics. 
One way to think about this function 
is to imagine a book. Each page of 
this book has on it a page number (the 
quantity r of the Appendix) and a 
function which relates SP to OP. 
These functions need not necessarily 
be continuous. The page number is 
simply a code which refers to a par- 
ticular set of identifying character- 
istics. Thus, given any event, you can 
find its SP by finding out its identi- 
fying characteristics, looking up the 
number corresponding to that par- 
ticular set of identifying character- 
istics, finding the page in the book 
which has that page number, and then 
looking up the SP corresponding to 
the OP of the event in question on 
that page. Mathematicians may find 
this concept and the notation of iden- 
tifying characteristics clearer if they 
examine the axiom system and proofs 
of Theorems 2 and 3 in the Appendix. 
The assumptions of that proof for- 
mally define ASEU models of this 
class. It turns out that the formula- 
tion in terms of a function book, even 
if that book may have an infinite 
number of pages, implies that SP 
always equals OP if additivity of SPs 
is assumed. But the notion of an SP 
function book is appropriate for 
NASEU models, for which no such 
implication exists. 

The idea of the previous paragraph 
should be familiar to psychophysicists. 

This model is analogous to a familiar 
model in auditory psychophysics. 
Perceived loudness of pure tones is a 
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function of both stimulus intensity 
and stimulus frequency. A number of 
different functions relating loudness 
to intensity, each using a different 
value of frequency as a parameter, 
are all an auditory psychophysicist 
would need to make predictions about 
loudness. (He would actually use 
equal loudness contours for the pur- 
pose, but the underlying model is the 
same.) 

The purpose of this notion is to 
provide an orderly way to deal with 
the possibility that more than one SP 
may correspond to a given OP, de- 
pending on display characteristics. 
Of course, I hope that such a book 
of SP functions would have relatively 
few pages, and an orderly structure, 
so that the sequence of pages is not 
arbitrary. But neither of these char- 
acteristics need be assumed in the 
following discussion. 

Incidentally, uncertain events as 
well as risky ones can be provided for 
within the notion of a book of 
SP functions. Presumably uncertain 
events will have identifying charac- 
teristics different from risky events 
(indeed having no definable OP is 
presumably itself one identifying char- 
acteristic), and so pages appropriate 
for such events can be segregated in 
one section of the book. Each such 
page has, as its function relating SP 
to OP, a flat horizontal line running 
from OP 2 0 to OP =1, (More 
precisely, this line is defined only for 
0 «OP <1.) Thus once its identi- 
fying characteristics are known, the 
SP of an uncertain event is completely 
specified. 


ASEU Mopets 


In spite of major differences in the 
axiom systems which lead to them 
and in the philosophy which motivates 
them, for the purposes of this paper 
most of the ASEU models are very 
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similar mathematically and opera- 
tionally to expected utility maximiza- 
tion models (hereafter called EU 
models) like the one proposed by von 
Neumann and Morgenstern (1947). 
The only difference, and it is a crucial 
one, is the use of SPs instead of OPs 
in the models. 


Finite vs. Infinite Event Sets 


Two main classes of possible ASEU 
models are defined by the dichotomy 
between finite and infinite event sets. 
Mathematically, the distinction is 
Simple enough. If we consider all 
Possible outcomes of 10 tosses of a 
Coin, then this is a finite event set. 

we throw a dart at a target, then 
all possible places it can hit form an 
Infinite event set (note that the point 
of the dart need not be a mathematical 
Point with zero area). Obviously a 
Person can encounter only a finite set 
of events in his lifetime, but that is 
"relevant, The point is that he will 
€ncounter many situations in which 
Some one of an infinite set of possible 
events may occur. At first glance, 
It would seem desirable, then, to have 
a decision model capable of coping 
With any member of an infinite event 
Sét—and most decision models are 
Just that. A few theorists (notably 

avidson, Suppes, & Siegel, 1957) 

ave preferred to think about finite 
Svent sets, mostly because they 
Wanted to be able to check all of 
Weir predictions, instead of some 
Dite subset of an infinite set of 
Predictions, As will become clear 
ater, finite event sets immensely 
Complicate the mathematics, and at 
"i Same time reduce the value of the 
ML by making it inapplicable to 
z uations in which the set of possible 
Vents is infinite. Although this 
ce Der discusses finite models below, I 
ir !Sider such models far less interest- 
than the infinite models. 
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Please remember that the notion 
of an infinite event set is not the same 
thing as the notion of a continuum of 
OPs. If all elements of a continuum 
of OPs are realizable by members of 
an event set, then that set is infinite 
(nondenumerably infinite, to be pre- 
cise), but an infinite set of events, all 
with different OPs, may still have the 
property that between any two OPs 
which are associated with members of 
the event set lies an OP which is not. 
In fact, continuity of OPs is not 
assumed directly in Theorems 2 and 
3 below, though it can be deduced as a 
theorem from the nonatomicity prop- 
erty which is assumed. 


Restricted vs. Unrestricted Sets of SP 
Functions 


How many different SPs may be 
associated with different events all 
having the same OP? Or, to ask 
essentially the same question in a 
different form, how many different 
functions may relate SP to OP? Still 
another formulation : how many pages 
may there be in the SP function book? 
Three classes of answers to this ques- 
tion exist for infinite event sets, and 
two for finite ones. For infinite event 
sets, the number of different SPs 
which may be associated with differ- 
ent events all having the same OP 
may be finite or infinite. If that 
number is infinite, it may be de- 
numerable (capable of being set in 
one-to-one correspondence with the 
integers) or nondenumerable (the set 
of all points on a line segment is 
nondenumerable; so is the set of all 
OPs). Itturns out that the finite and 
denumerably infinite cases have the 
same properties, SO only the latter, 
less restrictive case will be considered. 

The formulation of the relation be- 
tween OP and SP in terms of the SP 
function book tacitly presupposes 
that the set of different SPs which 
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may be associated with different 
events all having the same OP is at 
most denumerably infinite. If, as 
the preceding discussion has sug- 
gested, the difference between two 
different events with the same OP 
but different SPs lies in the way in 
which that OP is displayed, then the 
assumption of denumerability is sim- 
ply the assumption that all possible 
ways in which an OP could be dis- 
played could be listed (in infinite 
time). It is difficult to see why any- 
one would resist so mild an assump- 
tion—and yet Theorem 3 will show 
that its consequences are drastic. 
Nevertheless, it seems greedy for a 
theorist to insist that a denumerably 
infinite set of different SPs to asso- 
ciate with a given OP is not enough 
and that he wants more. I prefer to 
accept denumerability and abide by 
its rather drastic consequences. 
For mathematical convenience, 
however, it is expedient first to ex- 
amine a restriction wich applies even 
to the nondenumerable case. It is 
well known that an infinite number 
of different probability measures can 
be defined on a given event set. This 
makes it seem unlikely that any sub- 
stantial formal restrictions exist on 
models of the nondenumerable “any- 
thing can happen” sort. But there 
is at least one. Usually SPs rise with 
OPs. In fact, it is tempting to assume 
that they always do so. Consider two 
events, A and B. If the OP of 4 is 
higher than the OP of B and the SP 
of A is less than the SP of B, call that 
pair of events an inversion. The 
assumption that SPs always rise with 
OPs is an assumption that inversions 
cannot occur. But unless inversions 
exist in infinite numbers and at all 
values of OP, then it is easy to prove 
that SP always equals OP. T heorem 
2 below states a stronger assertion 
from which this one is a self-evident 


consequence. 


Warp EDWARDS 


Theorem 2: Assume that an infinite, 
nonatomic event set is being consid- 
ered, that every event has an SP and 
an OP, that the SPs of any EEE must 
add to one, and that some event can 
be found for which SP ~ OP. It 
follows that for any OP 5,0 < p <i, 
two events 4 and B can be found such 
that p(A) = p and p(B) < p(A) and 
V(B) > ¥(A) + Q where Q is a non- 
infinitesimal quantity (unless the 
maximum difference between SP and 
OP is infinitesimal, or P(A) or p(B) 
is infinitesimally different from Qor 1).2 

Amore precise mathematical formu- 
lation of Theorem 2, and its proof, are 
long, technical, and contribute little 
to the following discussion, I have 
therefore put them in a mathematical 
appendix which follows the list of 
references. The only important as- 
sumption necessary to prove Theorem 
2 which has not already been dis- 
cussed is that of nonatomicity; à 
discussion of it follows Theorem 3, 
for which it is also essential. Its 
content is primarily technical, The 
assumption that all events in the 
event space have both OPs and SPs 
has been discussed. The assumption 
that some event exists for which 
SP # OP is the only reason for even 
thinking about SPs in risky situations; 
without it there is no point in a theory 
about SPs, 

Lemma 4 of the mathematical ap- 
pendix implies some further state- 
ments about subjective probabilities 
in this nondenumerable case. If SP 
is ever different from OP, there must 
be some event or events for which 
the size of that difference (disregard- 
ing sign) is a maximum, or at any 
rate is only infinitesimally different 
from a maximum. In fact, because 
of the additivity of SPs there must 
be at least two such events, one in 


*'This theorem and its proof were formu- 
lated by W. M. Kincaid. 
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which the difference is positive and 
one in which it is negative. Call the 
event with the smallest OP for which 
this maximal difference is present and 
positive M. Event M has an OP 
P(M) and an SP y(M). For any 
Event A, defined (4) = (A) — p(A). 
Then d(M) is the largest positive 
value of d(4) that can occur, and 
d(M) is the largest negative value of 
d(A) that can occur. Now what can 
be said about the difference between 
SP and OP for events other than M? 
For any such Event A, max d(A) 
=d(M), and min d(A) = d(M). 
Lemma 4 shows that for every proba- 
bility p there must be an Event B 
having OP p such that: 


d(B) = wan] 


if 
MEME 
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p(B) < p(M) 


There must also be an Event C having 
P 5 such that: 


d(C) = rac | 2e] 
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nally, it shows that events must 
be aving that same OP $ and all 
Ssible valueg"of y between V (B) and 
relati In other words, in a space 
Sram E SP to OP there is a para s 
Ver, Of points defined by the four 


"tices (0,0) (1, 1), (PUD, YOD) 
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and [^ (M), V(M)]. All points within 
this parallelogram must be realizable 
by events in the event space with 
appropriate subjective probabilities. 
The preceding paragraph leads up 
to Theorem 3. Theawkward property 
exhibited in Theorem 2 applies to 
models which permit a nondenumer- 
ably infinite set of SPs to be asso- 
ciated with any given OP. Next 
consider what happens if any OP can 
be found (other than O or 1) with 
which no more than a denumerably 
infinite set of SPs is associated. 
'Theorem 3: As in Theorem 2, as- 
sume an infinite, nonatomic event set, 
each event having an SP and an OP, 
and assume that the SPs of any EEE 
must add to one. In addition, assume 
that there is some OP other than 0 
or 1 which has the characteristic 
that the number of different SPs 
which occur in conjunction with 
different events all having that OP is 
at most denumerably infinite. It 
follows that SP always equals OP.* 
Again a more precise formulation 
of the theorem, and its proof, are in 
the mathematical Appendix. Certain 
additional assumptions of purely tech- 
nical importance (e.g., continuity at 
zero) are necessary to the proof, but 
all the assumptions which I consider 
at all controversial are stated here. 
At this point it is worth while con- 
sidering the assumption of non- 
atomicity which underlies Theorems 
2 and 3. Suppose I assert “For any 
event which is already included in 
an event set, I wish to include another 
which is defined as the simultaneous 
occurrence of the event in question 
and heads in the toss of a coin which I 
happen to have handy." This asser- 
tion defines a nonatomic event set 
(in fact, it is somewhat stronger than 


3 This theorem and its proof were formu- 
lated by L. J. Savage. W. M. Kincaid turned 
Savage's sketched proof into the form con- 
tained in the Appendix. 
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is necessary to specify a nonatomic 
event set). In other words, the 
property of nonatomicity simply as- 
serts that any given subset may be 
further subdivided. Unless the set of 
events with which you intend to work 
is rigidly defined in advance, it is 
impossible, as I see it, to think of any 
real world event set which is not 
nonatomic. However, this property 
is far from trivial. For example, the 
assumption of nonatomicity directly 
implies that every OP is realizable 
by some event in the event set, and 
that the event set is nondenumerably 
infinite. 

All in all, the characteristics of 
ASEU models based on infinite classes 
of events seem unattractive. We may 
adopt any of three strategies. One 
is to accept and live with these un- 
fortunate properties. That possi- 
bility has already been examined, It 
is now time to mention very briefly 
the second set of possibilities, 


Finile Event Sets 


The arguments given previously 
against finite events sets were, I 
think, Strong enough to argue against 
their serious consideration. How- 
ever, it is interesting to speculate 
further about their mathematical 
properties. It seems quite likely to 
me that analogues of Theorems 2 and 
3 must exist for finite cases. Such 
theorems would define maximum dif- 
ferences between SP and OP as a 
function of number of events in the 
finite event set, spacing of the OPs of 
those events on the OP continuum, 
number of different SP functions 
permitted (if that number is less than 
the number of events in the event 
set), and number and location of 
inversions of ordering between SPs 
and OPs. It seems plausible both 
that such theorems exist and that 
Theorems 2 and 3 are limiting cases 
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as the number of events in the event 
set approaches infinity. Unfortu- 
nately, I have been unable even to 
guess what the form of such theorems 
might be (except for specially con- 
structed cases, such as the case of 
equal spacing of events along the OP 
continuum). As is often the case, 
it looks as if the finite cases are 
mathematically much more difficult 
than the infinite cases. Even in the 
case where only a single monotonic 
function relating SP to OP is per- 
mitted (surely the simplest finite 
case), I have been unable to formulate 
a specific theorem relating number 
and spacing of events in the event set 
to maximum possible difference be- 
tween OP and SP, 


The Third Strategy 


This paper has considered two 
classes of ASEU models based on 
infinite event sets and a class based 
on finite event Sets; all are unattrac- 
tive for various reasons, The third 
possibility is to abandon ASEU models 
altogether and use NASEU models 
instead. Some previously discussed 
considerations about utility functions 
make this option attractive, But the 
NASEU models are so much less 
powerful than the ASEU models that 
I would be very reluctant to turn to 
them on a priori grounds alone. So 
before examining them seriously, it is 
worth While to review the empirical 
evidence which bears on them. 


THE PROBABILITY PREFERENCE DATA 


A rather substantial body of data 
bears on the relation between SP and 
OP. These data come from a series 
of experiments (Edwards, 1953, 1954a, 
1954b, 1954d, 1955) which I have 
collectively called the probability 
preference experiments, 

The basic design of the probability 
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preference experiments was as follows: 
Lists of bets were prepared. Each 
list included eight bets, with OPs 
of winning (or losing, depending on 
the list) running from 1/8 through 
8/8. In various experiments a total 
of eight different lists were used. 
The eight bets in each list all had the 
Same objective EV, so that there was 
no objective reason to prefer one bet 
in a given list to any of the others in 
the same list. All bets in each list 
Were paired with one another accord- 
Ing to the method of paired compari- 
sons. At various times, a total of 
well over 100 college students were 
required to choose between the mem- 
bers of these pairs. In some experi- 
ments, the Ss sat in classrooms, 
looked at slides of the pairs of bets, 
and made choices "as if they were 
gambling." Other Ss, run individu- 
ally, first made imaginary choices, 
then gambled for worthless chips, and 
finally for real money. 

The main results of these experi- 
ments, taken as a whole, were: 
_ 1. Although there were substantial 
individual differences in choices, cer- 
tain patterns of choices showed up in 
all experimental groups and in just 
about all individuals. The two most 
Outstanding of these patterns were 
that Ss usually preferred the bet 
With the 4/8 probability of winning 
from any positive EV list to the other 
ets on that list, and usually preferred 
the bet with the lower probability and 
higher amount of loss in any pair of 
negative EV bets. 
. 2. The preferences cited above, and 
Indeed the complete pattern of choices, 
Were relatively independent of EV 
evel of the list of bets involved, so 
ong as the zero point was not crossed. 

is means that the preferences ob- 
Served should be attributed to the 
: Ps involved, which were constant, 
ather than to the amounts of money, 
Vhich varied from list to list. This 
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fact suggests that for these students 
and these amounts of money, the 
utilitv curve is relatively linear with 
amounts of money, while the SP 
curve is not linear with OP. It also 
suggests that the value of OP from 
which SP deviates most widely is 0.5, 
for positive EV bets. Using the 
assumption that SP = OP and the 
further assumption that the size of 
the just noticeable difference for 
utility is half a cent, it is impossible 
to construct a utility function to 
account for the preferences observed. 
If the size of the just noticeable differ- 
ence for utility is assumed to be zero, 
of course such a curve can be con- 
structed, but it has at least 12 inflec- 
tion points between $0 and +$6. 
The same sorts of statements can be 
made if the data are analyzed S by S. 

3. The complete pattern of choices 
changed radically from positive EV 
to negative EV bets, even though 
exactly the same OPs were used. 
The main change was a strong prefer- 
ence for negative EV bets in which 
the probability of losing was low and 
the amount of possible loss high. 
When this preference was removed 
from the data by crude statistical 
means, the residual preference pat- 
tern was pretty much the mirror 
image of the preference pattern for 
positive EV bets. These findings 
suggest that there is a strong inter- 
action between utility and SP. 

4. The differences among just im- 
agining, gambling for worthless chips, 
and gambling for real money are dis- 
cernible, but not sufficiently serious 
to invalidate any of the conclusions 
discussed above. This means that 
the difficulties which might arise in a 
gambling experiment because S wins 
or loses on each bet before he decides 
about the next bet are much less 
severe than a priori considerations 


might lead one to expect. 
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5. Minor changes in wording, in 
the choice of payoff events and in the 
financial status of Ss do not make any 
detectable differences in their choices. 

. In particular, Ss can be made to win 
exceedingly large amounts of money, 
or made to lose substantial but smaller 
amounts of money, without signifi- 
cantly changing their patterns of 
choices. This finding has been con- 
firmed in a study devoted especially 
to it (Edwards, 1954d). 

Findings 1 and 2 cause me to 
reject the hypothesis that SP = OP. 
"Therefore, they, in conjunction with 
the considerations discussed in the 
previous section, cause me to reject 
all EU and ASEU models. The im- 
plications of Findings 3, 4, and 5 for 
a NASEU model are important; they 
will be discussed below. 


NASEU Mopets 


What are the mathematical proper- 
ties, if any, of a NASEU model? 
Take a look at Equation 1. It defines 
NASEU as well as ASEU models. 
But what does the operation of addi- 
tion called for by the sigma in Equa- 
tion 1 mean for a NASEU model? 
For an ASEU model, that sigma sim- 
ply derives from the usual definition 
of a probability measure, and reflects 
the additivity property of that meas- 
ure. Fora NASEU model, it has no 
such simple mathematical justifica- 
tion. A variety of psychological 
arguments about its plausibility are 
available. This paper will not press 
them; evidence to be presented later 
argues against this form of NASEU 
model, and I do not advocate or 
defend such a model. The purpose of 
this section is to introduce some ideas 
which are a part of the model which 
I do defend. For the time being, there- 

fore, it is enough to say that no law of 
men or mathematics forbids the multi- 
plication of utility and nonadditive 
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SP values, the algebraic addition of 
the products, or test of the hypothesis 
that people make decisions in such a 


way that they in fact maximize the 
resulting sums. 


Ratio Scale Utilities 


However, Theorem 1 indicates that 
a NASEU model must use a ratio 
scale measure of utility. A ratio 
scale of utility implies that utility 
has a true zero point. Where is it? 
Only one answer is plausible: where 
you now are. Zero utility is your 
current position, and you can never 
leave it. 

This is not exactly a new idea. 
The first implicit use of it was by 
Mostellr and Nogee (1951), who 
used the EU model to make a utility 
scale for money (and, incidentally, 
Proposed nonadditive SPs as an 
alternative explanation for their re- 
sults). Mosteller and Nogee used a 
gambling situation in which real 
money changed hands during the 
course of the experiment. But in 
determining the utility of a given 
amount of money, they simply found 
an indifference point which involved 
that amount of money without mov- 
ing the origin to take into account S's 
financial status at the time they 
measured it. Such a procedure can 
Yield a classical utility function only 
if the form of that function is in- 
Variant up to a linear transforma- 
tion under movements of the origin 
(through which all utility functions 
conventionally pass) along the func- 
tion. Only a very limited class of 
functions, of which the most familiar 
member is the straight line, has this 
property, and the utility curves 
Mosteller and Nogee obtained don't 
look like any member of that class. 
Consequently those curves in fact 
defined zero utility as the present 
monetary status of each S, and so 
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were of the type that this section 
advocates. 
. The first self-conscious use of an 
idea like that proposed here was 
made by Markowitz (1952). He 
suggested that the zero point of the 
utility scale be taken as S's customary 
financial position, and that the form 
of the function changes when that 
customary financial position changes. 
He used this idea to remedy a defi- 
ciency in Friedman and Savage's 
(1948, 1952) previous account of 
gambling and insurance buying. The 
only difference between Markowitz's 
Position and this one is that Marko- 
witz defines zero utility as the custom- 
ary position, while this paper defines 
It as the current position. 
_ Although this conception of utility 
is novel and quite different from the 
traditional one, it is neither internally 
contradictory nor absurd. Nor does 
it imply that people cannot change 
their habits as they change their 
financial status. Such changes in 
habits simply show up as changes in 
the shape of the utility curve. Fur- 
thermore, this point of view is not 
empirically contentless. If it is to be 
useful, people must not make serious 
Changes in their subjective value 
Scale for a commodity as a result of 
relatively small changes in the amount 
of that commodity which they possess. 
his proposition is in principle test- 
able, though testing it experimentally 
is difficult. However, Findings 4 and 
of the probability preference experi- 
Ments (see above) suggest that it 
May be correct. 


Tndifference Curves and a Property of 
ASEU Models 


Loss of the addition theorem for 
Ps, even though compensated by a 
true zero for utility, is a very serious 
Oss. Can a model of the type de- 
Ned by Equation 1 mean much with- 
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out an addition theorem for SPs? 
Yes. Equation 1 implies a strong 
decomposability property which has 
considerable significance both mathe- 
matically and empirically. The fol- 
lowing discussion exhibits this prop- 
erty and derives two theorems, one 
of which can be empirically tested. 
These theorems have the additional 
advantage of being true for all 
plausible models defined by equations 
like Equation 1. This means that an 
empirical verification of them would 
be strong evidence for the usefulness 
of the class of maximization models 
with which this paper is concerned, 
while an empirical disproof would 
necessitate search for a quite different 
kind of theory of decision making (e.g., 
one based on variance preferences). 
The following discussion deals ex- 
clusively with bets of the following 
form: Event E has probability p of 
occurring. If it occurs, S receives an 
amount of money x, which is greater 
than 0. If it does not occur, no 
money changes hands. Any such bet 
can be described as a point (x, p); 
that is, all such bets can be com- 
pletely represented by a plane bounded 
at p=0, p=1, and x = 0. Now, 
consider the bets (x, p) and [ (x + e)p] 
where c'is a small positive number. 
A great deal of evidence, plus common 
sense, suggests that most Ss would 
prefer the second of these two bets 
to the first. Similarly, [x, ( + /)] is 
preferable to (x, P) where f is another 
small positive number. If both of 
these assertions continue to be true 
as e and f approach zero for any 
values other than zero of x and p 
(a reasonable idealization), and if the 
direction of preference reverses for 
any negative values of e and f (an- 
other reasonable idealization), it fol- 
lows that a set of indifference curves 
can be drawn in the xp plane. An 
indifference curve in this application 
is simply a function which defines a 
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Fic. 1. Hypothetical indifference map 
among simple bets which have “no money 
changes hands” as one of the two possible 
outcomes. The rectangles are described in 
Theorems 4 and 5 of the text. 


set of bets among all of which S is 
indifferent (see Coombs & Beardslee, 
1954; Edwards, 1954c for details). 
A set of indifference curves is called 
an indifference map; Figure 1 shows 
such a map. The properties to be 
discussed below are general properties 
of a certain family of indifference 
curves and have applications to the 
use of indifference curves in economic 
theory; these applications will be 
discussed at the end of this section. 

What does the SEU model have to 
say about such indifference curves? 
Since each curve identifies bets among 
all of which S is indifferent, all bets 
on a given curve must have the same 
SEU. In short, in utility and SP 
measures the equation of each in- 
difference curve c must be: 


Welle = ke [4] 


Of course there is a different value of 
k for each curve. 
But indifference curves are stated in 
physical, not subjective, units. W sie 
implications does Equation 4, m 
therefore SEU maximization models, 


have about the form of indifference 
curves stated in physical units? It 
is obvious that if any single indiffer- 
ence curve is selected, it is possible 
to find a transformation on the prob- 
ability and money axes which will 
make it conform to Equation 4, which 
is of course the equation of a rectangu- 
lar hyperbola. In fact, any set of two 
indifference curves could be made to 
conform by proper choice of trans- 
formations. But that uses up all the 
arbitrariness implied in Equation 1. 
To put it another way, the trans- 
formations necessary to make these 
two curves conform to Equation 4 
completely specify the utilitv and 
subjective probability functions for 
this S. Now, if Equation 4 is to be 
Correct, it must turn out that all 
other indifference curves in the xp 
plane must be turned into rectangular 
hyperbolas by the same utility and 
SP functions which turned the first 
two into rectangular hyperbolas. This 
is an exceedingly strong condition on 
the family of indifference curves, 
The following proof shows that an 
indifference map satisfies this require- 
ment if and only if a quite specific 
relationship exists among the slopes 
of the indifference functions passing 
through any four points at the corners 
of a rectangle whose sides are parallel 
to the coordinate axes. 

The theorem assumes that an in- 
difference map is given each of whose 
functions is continuous, monotonic 
decreasing, and has continuous deriva- 
tives. The utility and SP trans- 
formations are assumed strictly mono- 
tonic increasing, They are also 
assumed single-valued, which means 
that the events displaying the OPs 
used must all have the same identify- 
ing characteristics if Theorems 4 and 5 
are to be applicable. Two values of 
money, xı and x» and two values of 
Probability, pı and ps, are chosen 
at random (or Systematically; it 
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doesn't matter how they are chosen 
so long as both amounts of monev 
are greater than zero and both proba- 
bilities lie between zero and one); 
they of course jointly define four bets. 
Let us call these four bets -1, B, C, 
and D, as in Figure 1. From now on, 
Sa means the slope of the indifference 
curve which passes through Bet -1. 

Theorem 4: If any kind of SEU 
model, additive or otherwise, is cor- 
rect, then S4Sc = SpSp, and if this 
condition on the slopes of an indiffer- 
ence map is satisfied, and if at least 
one indifference function has a nega- 
tive slope, then it follows that for 
these data a NASEU model or some 
more powerful SEU model is correct.* 

The proof of Theorem 4 is in the 
mathematical appendix. 

The property of all SEU maxi- 
mization models, including NASEU 
models, which is established in 
Theorem 4 is theoretically elegant 
and in principle empirically testable. 
No such empirical test is likely to 
take place, however; differentiation 
of empirical curves requires more 
precision than is usually obtainable 
in experiments using human Ss. 
Fortunately, the microscopic property 
described in Theorem 4 has a macro- 
Scopic equivalent, which can be tested 
experimentally. 

Look at Figure 1 again. Choose an 
amount of money Vi, significantly 
Smaller than xı Extend a vertical 
Ine from x’, to the indifference curve 
which passes through D; call the 
Intersection point M. From Z extend 
a vertical line until it intersects the 
indifference curve which passes through 
4l, and a horizontal line until it inter- 
Sects the indifference curve which passes 
through C; call the two points thus 
identified E and G. A horizontal line 
Can be extended from E and a vertical 


Ap: H a r - 
This theorem and its proof were formu 


Ate, " 
ted by George Minty. 
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one from G, intersecting in a point F 
and thus completing another rec- 
tangle. Will Point F lie on the indif- 
ference curve which passes through 
Point B? It turns out that it will if 
and only if the indifference map has 
the property expressed in Theorem 4. 
It is probably intuitively apparent 
that the property of Theorem 4 can 
be derived from this one; Theorem 4 
is simply the limiting case as the 
difference between x; and x’; becomes 
infinitesimal. This property can be 
expressed formally as 

Theorem 5: If any kind of SEU 
model, additive or otherwise, is cor- 
rect, then if a second rectangle is 
constructed as in Figure 1 and three 
of its vertices are chosen to lie on the 
same indifference curves as three of 
the vertices of the original rectangle, 
then the fourth vertex of the new 
rectangle must lie on the same in- 
difference curve as the fourth vertex 
of the original rectangle. Further- 
more, if the vertices of the new rec- 
tangle are connected by straight lines 
to the vertices of the old rectangle, 
the slopes of the connecting lines 
will be related by the equation 
SapSce = SarSpu, where Sag sym- 
bolizes the slope of the line connecting 
Point 4 with Point E in Figure 1. 
If these conditions are satisfied, and 
if at least one indifference function 
has a negative slope, then it follows 
that for these data a NASEU model 
or some more powerful SEU model is 
correct.® 

To prove Theorem 5, start by look- 
ing again at Equation 4 With 
notation taken from Figure 1, it says 
that: 


g(x)0h(p) = g(xh(pi) [C5] 
g^)h (p'e) g(x)h p?) [6] 
gh) = g()h(p) CT] 


5 This extension of Minty's theorem was 
formulated by L. J. Savage. 


ll 
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where g(xi) is the utility of xi, A(p1) 
is the SP of pi, and so on. Now 
multiply the left half of Equation 6 
by the left half of Equation 7 and 
divide by the left half of Equation 5, 
and similarly for the right halves of 
these equations. The results are: 


g(x'z)h(p's) = g(x)h(p) [8] 


Equation 8 is what the first half of 
'Theorem 5 asserts. 

Now note that if a line is drawn 
from A to E and similarly for the 
other three pairs of vertices, each of 
these lines is a chord of an indifference 
curve. The slope of any of these lines 
is given by: 


Pa 
Sij = x';—xi [9] 
where: 
4,j = 1,2 
A little algebra will show that 


SaxSoé = SprSpu, which is what the 
second half of Theorem 5 asserts. 
This result is a close macroscopic 
analogue of Theorem 4. In fact, 
Theorem 4 follows immediately from 
it if x^; is allowed to approach xi, for 
then the four chord slopes approach 
the slopes of the indifference curves at 
the vertices of the original rectangle. 
This proof of Theorem 5 is only a 
proof of necessity; a proof of suffi- 
ciency very similar to that given in 
the Appendix for Theorem 4 exists, 
but will not be given here. 

The property of NASEU (and 
other) models expressed in Theorem 5 
can be tested by trying to perform the 
construction specified in the first 
half of Theorem 5. The relationship 
among slopes specified in the second 
half of Theorem 5 is tautologously 
true for all pairs of parallelograms 
with sides parallel to the coordinate 
axes, and so provides no additional 
information concerning the NASEU 
model. An experiment designed to 


test this property, and so to test this 
class of models, is now under way. 

Although the foregoing discussion 
was expressed in terms of bets, it is 
actually a quite general property of 
certain kinds of indifference curves. 
The properties of Theorems 4 and 
5 clearly apply to any indifference 
map whose curves have the form 
S(x)g(y) = k. But they also apply to 
any indifference map whose curves 
have the form f(x) + g(y) = k. (To 
prove this, simply replace the multi- 
plications in Equations 5-7 by addi- 
tions and the divisions by subtrac- 
tions. The proof now proceeds as 
before, using additions instead of 
multiplications and subtractions in- 
stead of divisions up to but not in- 
cluding Equation 9. A similar modi- 
fication of the proof of Theorem 4 is 
possible.) This additive form of the 
indifference curves corresponds to an 
indifference map concerned with two 
goods which are independent, rather 
than competing (like butter and 
margarine) or completing (like right 
and left shoes), Thus Theorems 4 
and 5 provide an operational test of 
the hypothesis that two goods are 
independent of each other for a given 
indifference map—so far as I know, 
the only operational test so far sug- 
gested which can be applied directly 
to an indifference map. Of course 
this test is applicable only if the 
existence of at least interval scale 
utilities is assumed. 


The Weighted SEU Model 


Most of the argument of this paper 
has been devoted to the proposition 
that for any of the class of models 
considered here except NASEU models 
any function relating SP to OP may 
only be linear. The probability 
preference data indicate that it can- 
not be linear. Or do they? 

An old familiar finding in psycho- 


A 
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physics is that the form of any sub- 
jective scale depends on the methods 
used to determine it. The same prop- 
osition may be true for SP and utility 
scaling. The probability preference 
experiments were a rather indirect 
method inferring utility and SP 
properties from gambling choices. 
What results would more direct 
methods yield? 

In the case of utility of money, it 
seems unlikely that direct methods of 
psychophysical scaling will be useful. 
The kind of question which would 
have to be asked would sound like 
"What amount of money is twice as 
valuable as two dollars to you?" It 
seems unlikely that any S would give 
an answer systematically different 
from "Four dollars," no matter what 
his utility function for money might 
look like. The difficulty is that 
money has an invincibly numerical 
Character, and most people would 
probably respond to mathematical 
properties of these numbers rather 
than to any subjective values they 
might have. Inany case, the evidence 
is continuing to accumulate that for 
small amounts of money utility is 
linearly related to dollar value—in 
which case psychophysical determina- 
tion of utility scales is not very likely 
to be interesting. Of course psycho- 
physical methods could be used to 
Scale the utility of other objects 
which have a less numerical character, 
but it isn’t obvious how such experi- 
Ments would shed much light on 
general properties of utility functions, 
because in such experiments the con- 
cept of objective value is difficult to 
define. 

In the case of SP, on the other hand, 
Some psychophysical data exist. Shu- 
ord (1959) performed an experiment 
n which he showed Ss square matrices 
Made of 400 thumbtacks, some red 
some green. The Ss were re- 


Wired to estimate the relative fre- 


quency of each kind of thumbtack 
in the display after a brief look, and 
succeeded in estimating with remark- 
ably good accuracy. Relative fre- 
quency in a matrix isn't exactly the 
same thing as OP, but is very closely 
related. Shuford’s experiment sug- 
gested one in which I used these 
square matrices as the basis for 
probabilities in the probability prefer- 
ence experiment situation. I dis- 
played two such matrices and asso- 
ciated a bet with each. The Ss were 
required to estimate the relative 
frequency of each color in each of the 
two matrices, and then were required 
to choose one of the two bets. A 
random device (two 20-sided dice) 
chose one location on the matrix 
associated with the chosen bet, and 
the color of the thumbtack at that 
location determined the outcome of 
the bet. Preliminary results show 
that the estimates of relative fre- 
quency are excellent, just as in 
Shuford's experiments, but that the 
choices made among the bets are 
distorted. In short, at almost the 
same moment the Ss say that SP is 
and is not equal to OP. Something 
peculiar is clearly going on. 

My interpretation of this finding is 
that people perceive OPs correctly, 
but that they misuse them. They 
simply prefer to bet at some OPs 
rather than others, even though they 
perceive these OPs quite accurately. 
This hypothesis can be translated into 
mathematics by supposing that they 
attach to each event a weight w; which 
expresses the relative desirability or 
undesirability of the probability dis- 
played by that event. A model of 
this sort is only a minor variation on 


Equation 1: 


WSEU = X, (pawa)ui [10] 


Mathematically, the WSEU model 
is exactly the same as the NASEU 
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This paper shows that if the sub- 
jective probabilities of a set of events 
of which some one must happen and 
not more than one can happen must 
add up to a specified constant (usu- 
ally taken as one for convenience), 
then utility can be measured on an 
interval scale and subjective prob- 
ability on a ratio scale. But if 
subjective probabilities do not have 
this additivity property, then both 
utility and subjective probability 
must be measured on a ratio scale. 

If subjective probabilities do have 
the additivity property, if the deci- 
sion model in which they are used is 
in principle applicable to any con- 
ceivable set of events, and if the 
number of different subjective proba- 
bilities which may occur in conjunc- 
tion with a given objective probability 
is no more than denumerably infinite, 
then whenever objective probabilities 
are defined subjective probability 
must equal objective probability. 
This theorem and another mean that 
the concept of subjective probabilities 
which must add to one, and decision 
models based on that concept, cannot 
be very helpful in explaining decision 
making; a large amount of data indi- 
cates that subjective probability can- 
not always equal objective probability. 

One alternative is to use a decision 
model in which subjective probabili- 
ties need not add up to any particular 
constant. Such a model is possible; 
its properties are explored. One 
property is that in such a model, 
utility must have a true zero point. 
The only reasonable zero point for 
utility is where you now are; this 

notion is not new to decision theory. 

Any form of the expected utility 
maximization hypothesis (with addi- 
tive or nonadditive subjective proba- 
bilities) implies a strong relationship 
among the slopes of indifference 

curves for simple bets. This relation- 


ship leads to an experimental test of 
the applicability of models of this 
class. If the result of this test should 
be negative, further theories about 
decision making in risky situations 
should probably include the concept 
of variance preferences. 
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APPENDIX“! 


lhis appendix contains precise state- 

Dents of the assumptions and proofs of 

€orems 2, 3, and 4 from the body of 

the article, The proofs of Theorems 1 
and 5 are contained in the text. 

heorems 2 and 3 assume the existence 

a probability space S with events 


dit footnotes 


Arg : 
i See body of article for cre 
h theorem. 
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A, B, C, +++, having objective probabili- 
ties p(A), p(B), p(C), +++; the measure p 
has the conventional properties. Asso- 
ciated with each Event A is an element 
r(A) of a space R; r(A) is called the 
identifying characteristic of Event A. 
Two or more events (indeed, large classes 
of events in the models discussed in this 
paper) may have the same identifying 
characteristics. A function T(r, p) is 
defined for each r and p such that for 
some Event A, r(A) = rand p(A) = $. 
The subjective probability (A) of the 
Event 4 is then defined by the equation: 


WA) = TErGD, 80] 


Several assumptions, mostly of tech- 
nical importance only, are necessary. 
They are stated here with little discus- 
sion; the important ones are discussed 
in the text. 

Axiom 1: p is completely additive. 

Axiom 2: S is nonatomic with respect 
to p. This means that if p(A) > 0, then 
there exists some Event B contained in A 
(a subset of A) such that 0<p(B) <p(A). 

Axiom 3: For each r for which T(r, 0) 
is defined, T (r, 0) =0, and im T(r, p)=0 

P 


uniformly in r. That is, given € > 0 
there exists a ô > 0 such that T(r, p) <e 
whenever p < à and T (r, p) is defined. 
Axiom 4: W(A) is a probability 
measure. Specifically: (a) For ACS, 
o<sy(A)<1., (b V(S)- 1 and 


VG) — 0, and (c) If A, B,C S and 
Af|B-0, then V(4 U B) = ¥(A) 
+ y(G). 


There is some redundancy in the state- 
ment of Axiom 4; it was included for 
clarity and to permit Theorem 4 to be 
proved without the use of Part ¢ of the 
axiom. 

Some lemmas prepare the way for the 
proof of Theorems 2 and 3. 

Lemma 1: The probability measure y 
is completely continuous with respect to 
p. and completely additive. 

From Axiom 3, it follows immediately 
that to any number e > 0 there corre- 
sponds a number 6>0 such that V (4) <€ 
for any Event 4 such that p(A) « à. 
This is what complete continuity means. 


132 


Now let A, A», As, ++- bea countable 
sequence of disjoint events such that 


ti H H 
U A, =A. Since p is completely addi- 


n-l 


tive, we have p(A) = Ð p(A,) and 


n=l 


(setting Ba = U Ax) lim &(B,) = 0. It 

kn 
follows from the foregoing discussion 
that lim V(B,) = 0. Since p(A) exists, 


nn 


so does Y (A), and since it follows from 


n= 


n-1 
Axiom 4 thaty (4) = 2 V(A4) + V(B.) 
- 


for all z, we conclude that 


¥(A) 
= X ¥(A,), and thus that V is com- 


nel 
pletely additive 
Lemma 2: S is nonatomic with respect 
to y. 

Proof: Lemma 1 makes it possible to 
apply the Radon-Nikodym theorem (see 
for example Halmos, 1950, p. 128) to 
infer that y has a probability density 
with respect to b. If A is any event 
with ¥(A) > 0, it must include a Sub- 
event A’ such that P(A’) > 0 and the 
density of V with respect to p is positive 
on A’. Since S is nonatomic with respect 
to p by Axiom 2, there is an event 
B C A' such that 0 < p(B) < 5(A^);con- 
sequently also 0 < (B) <¥(A’) < y(A). 
This proves the lemma. 

Lemma3: Let f bea (Borel-measurable) 
function defined on S such that 0 < fw) 
<S 1 for w in S, and let Q; and Qe be 
completely additive nonatomic prob- 
ability measures. Then there exists an 
Event Z such that: 


[reo = (LZ), 
S 


f Sdu) = U(E) 
S 
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This result is a specialization of one 
established by Dvoretsky, Wald, and 
Wolfowitz (1951a, 1951b), representing 
an extension of an earlier theorem of 
Lyapounov (1940). 

It is convenient at this point to inter- 
pret the objective and subjective prob- 
abilities of an Event A as the rectangular 
coordinates of a point [p(A), v(A)]ina 
Euclidean plane. Then if p and y are 
not identical measures there must exist 
an Event H for which V1) = p(H), 
and the points (0,0), [p(H), ¥(H)], 
CoH), V GT)], and (1, 1) (where H is 
the complementary event to H) form a 
parallelogram. The following lemma 
asserts that there is at least one event 
corresponding to every point within the 
parallelogram. 

Lemma 4: Let // be any event, and 
let a and b be any two numbers such that 
OSaS1and0 Sb< 1. Then there 
exists an Event C such that b(C) =ap(H) 
+ bi — p(H)] and ¥(C) = ap(H) 
+ bf! =W) 

Proof: Let g be the characteristic 
function of H (that is, let g(w) be 1 for 
win H, and 0 otherwise). Then by 
Axioms 1 and 2 and Lemmas 1 and 2 
the hypotheses of Lemma 3 are satisfied 
with Qi = p, Q = y, and f(w) = ag(w) 
Tb1- £(w)] The Event C is then 
the event whose existence is asserted by 
Lemma 3. 5 

Now it is possible to prove 

Theorem 2: If there is an Event H for 
which Y (H) = DH), then for any value 
of p within the limits 0 < P <1 there 
exist Events A and B satisfying the 
relations p(A) = b. b(B) < p(A), (B) 
> ¥(A) + P(1— P)|yr) — p(H)|. 

Proof: It is sufficie 
case p(H) € 1/2, p x 
=h 70; parallel ar 
the result fo 

Suppose 


nt to consider the 
1/2,5 (H) — p(H) 
guments establish 
r other cases, 

first that 0 < P SX p(H). 
By Lemma 4 (with a = 0 and b = P/ 
[1 — b(H)] there exists an Event 4 
Such that (4) = P, while y(A) 
= P[1 -—VOD[- 5 1)]. Similarly, 
taking a = P/p(H), b — 0 yields the 
conclusion that there exists an Event B’ 


with P(B') P. v(B’) = PV (H)/p(H). 
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Subtraction yields the inequality: 


V^) = V) 
TRN hP 
P(HA)LL — pn] 
_ Next suppose p(I) < p 31/2. The 
Event A may be selected as in the fore- 
going case, while an appropriate Event B^ 
corresponds to a = 1, b = [P — p(H)]/ 
[1 — p(I)]. Thus in this case also: 


V(B) -p (A) =v (HT) 
P—p(H) 


> hP(i — P) 


tcpa UD 
I-H) h 
Pm Ig ran 


Now let the values of a and b corre- 
sponding to the Event B’ be multiplied 
by the factor: 


hP(1 — P) a] 
p(B’) 


Lo designate an associated event by B. 
ince k < 1, it follows that: 


ZEE 


p(B) = kp(B’) < P = P(A) 


while 


V(B) S iy (B^) 
=1/2Q (B^) H- AP (1 —P)+4(4)] 
>hP(1—P)+(A) 


Thus the Events A and B have the de- 
Stred properties and the proof is complete. 

heorem 2 could be strengthened, in 
that the expression hP(1 — P) could be 
replaced by any continuous function 


P) satisfying the inequalities: 
G(P) < hP ] 
Pun — »0D] 
0 < P € min [2UD), 1 — puni 


BU ue E 
)« max [P (1), 1 — BD] 


min Coz), 1 — p(H)] 
€ P € max [p(H) 1 — bun] 
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na — BP) 
BD — 8D) 
max [(H), 1 — p(H)] SP <1 


GIF) € 


The proof is changed only by the re- 
placement of AP(1 — P), where it ap- 
pears, by G(P). 

Lemma 4 also makes it possible to draw 
a strong conclusion about the cardinality 
of the set of identifying characteristics. 

Theorem 3: If there exists an Event H 
for which V(UI) # p(H), the set R of 
identifying characteristics is nondenum- 
erably infinite. 

Proof: Let the hypothesis of the 
theorem be satisfied for some H. Sup- 
pose for convenience that: 


p(H) S 1/2 < 1 — pl) 
Then 
ap(H) + [1 — p(H)] = 1/2 
if 
b = [1/2 — ap) [1 — 80D] 


Since the foregoing relations imply that 
osbsi if 0< a1, Lemma 4 ap- 
plies, and for each value of a in the indi- 
cated range there exists an Event Ca such 
that p(Ca) = 1/2, while: 


1/201 —V UD) J+ H) — 802] 
1—p(H) 


Y (Ca) = 


Thus distinct values of @ are associated. 
with distinct values of V(C;). Since 
p(C«) is the same for all a, the identifying 
characteristics r(Ca) must likewise be 
distinct for distinct a’s. Thus a subset 
of R can be put into one-to-one corre- 
spondence with the unit interval, and 
the theorem is established. 

The general mathematical setup under- 
lying Theorem 4 is explained in the body 
of the paper; it is not altogether the 
same as that underlying Theorems 2 and 
3. Theorem 4 is concerned with bets of 
the form "with probability p you win 
$x; with probability 1 — p no money 
changes hands." Specifically, it as- 
sumes the existence of indifference curves 
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in a plane of such bets, and inquires 
about the restrictions which a NASEU 
model puts on such curves. t . 

Axioms 1 and 2 of this Appendix still 
apply, Axiom 3 is irrelevant, and Parts 
a and b of Axiom 4 are assumed (but not 
Partc). The following additional axioms 
are assumed. 

Axiom 5: For the particular situations 
in which Theorems 4 and 5 are to be 
tested, V(p) is a continuous, differenti- 
able, monotonically increasing func- 
tion of p. 

Axiom 6: For the particular situations 
in which Theorems 4 and 5 are to be 
tested, u(x) is a continuous, differenti- 
able, monotonically increasing func- 
tion of x. 

Axiom 7: The model asserted by 
Equation 1 of the text applies. That is, 
in this situation Bet A is preferred to 
Bet B if and only if: 


V(ba)u(xa) > V(bs)u(xg) 


Now we perform the construction de- 
scribed in the paper and illustrated in 
Figure 1. If S4 is the slope of the in- 
difference curve passing through Bet A, 
we can prove 

Theorem 4: S485; = SpSp. 


First comes a proof that if the in- 
difference curves can be transformed 
into a form fitting Equation 4 of the 
paper, then the first half of Theorem 4 
must hold. If Equation 4 can be written 
for any indifference curve, there must 
exist two functions g and } such that 
the equation for each indifference 


curve 
can be written: 


g(x)h(p) = ke [A1] 

Differentiating Equation 11 gives: 

g'(x)h(p)dx + g(x)h' (p)dp = 0 
dp _ . g(x) hp) 


dx gx) (p) 


oF a P(x)O(p) = S(x, p) [A3] 
dx 


[A2] 


which simply says that the slope of the 
indifference curve at any point may be 


written as a product of a function of x 
along and a function of p alone. Now 
it is tautologous that: 


$(x)0(p) — B(x2)0 py) [A4] 
P(x1)0(p1) — $(x)0p) 


Substitution from Equation A3 reduces 
Equation A4 to: 


But by definition (xj, b:) defines Bet A, 
(x2, pe) defines Bet B, (xa, pı) defines 
Bet C, and (xi, pı) defines Bet D. So 
Equation A5 may be rewritten: 


SaSc = SpSp [A6] 


The proof so far has shown that if 
Equation 4 is correct then Equation A6 
follows. In short, Equation A6 is neces- 
sary for the correctness of Equation 4, 
and so of Axiom 7. It remains to show 
that it is also sufficient. The statement 
of Theorem 4 given in the text of the 


Paper specifies that Equation A6 is 
sufficient as well as necessary for the 


correctness of Equation 4 only if at least 
one indifference curve has a negative 
slope; this restriction is necessary to rule 
out transformations on b and x which 
are monotonically decreasing rather than 
monotonically increasing. To prove 
sufficiency given at least onc indifference 
curve with a negative slope, it is neces- 
sary to exhibit transformations which 
actually fit Equation 4. 

To start, choose any point (xo, po), 
xo >0,0 <p) <1. Now let 


u = exp — f S(x. bo)dx CA7] 


70 


D 1 
V = exp S(xo, bo) zr dp [A8] 
m S (Xo, p) PE 
Now it is necessary to prove that 
these transformations fit Equation 4 if 
they fit Equation A6. 


First, differentiate 


Equations A7 
and A8: 


B 


—————— A all 


— 
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p Substituting into .V9 from X6: 
dias [esp Sie m f^ - E 3 a ubstituting into .\9 from .\6 
po TGs P dy - y r BIN Po) dp 
s 
x [5s TEN = 5 u| du u S(x, p) SQ. Po) dx 
ALI Ix IL | 
dim [esp - T S(x, sous du u S(x. p) dx 
à But 22 = s(x, p) by definition. Tl 
x [-S(x. po)dx] ut. = S(x, p) by definition. here- 
So fore, 4 = -* which is the differential 
dy ed s y S(xo, Po) dp [A9] equation for the family of rectangular 
du u S(x, po) S(xo. p) dx , hyperbolas. 
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OBJECTIVE MEASU 
PROBABILITY 


RES OF SUBJECTIVE 
AND UTILITY! 


GORDON M. BECKER? 


General Dynamics Corporation, Groton, Connecticut 


This paper describes several meth- 
ods for measuring subjective proba- 
bility. Some of the methods require 
a priori measures of utility, others 
measure both utility and subjec- 
tive probability simultaneously. The 
measures, derived from choice be- 
havior, do not necessarily reflect the 
subject’s (S's) conscious opinions or 
feelings. The subjective probability 
estimates may be useful as measures 
of confidence, bias, prejudice, or 
expectancy. 

The basis for the methods is a 
modification of Edwards (1955) Sub- 
jective Expected Utility (SEU) model. 
It is assumed that: 

1. A person confronted with a 
choice between two or more alterna- 
tives selects the alternative that 
appears, to him, to offer the highest 
average satisfaction (SEU). The 
SEU of an alternative, A, is defined as: 


SEU(A) = x V(A )u(A;) 
iel 


! The major ideas discussed in the first half 
of this paper were presented at the Thirteenth 
Annual Meeting of the Eastern Psychological 
Association, Atlantic City, April 3, 1959, 
This investigation was supported in part by 
Research Grant M-2348(RI) from the 
National Institute of Mental Health, United 
States Public Health Service, in part by the 
General Dynamics Corporation, and in part 

tek Corporation. 

a en asked at TEMPO, General Elec- 
tric Company. 

The dior is indebted to Ward Edwards 
for his help in making these assumptions ex 
plicit, for encouraging the author to rie 
these ideas, and for suggesting that the T jor 
consider the effects of various degrees ol l Bs 
formation about utility upon the determina- 
tion of subjective probability. 


where: 
A; is the ith possible outcome of course of 
action.A; $2 1, 2, ««5, 7. 


Ta 

V(A;) is S's subjective probability that Out- 
come 4; will occur. 

1 (A,) is the satisfaction (utility) S expects 
to derive if Outcome A; occurs when .S 
takes course of action 1. 


2. The sum of the subjective proba- 
bilities of a set of mutually exclusive 


and exhaustive events equals unity,’ 
KAA 


I 
x (4) — 1 


3. Utility and subjective 
bility do not interact. 

4. The satisfaction (utility) S asso- 
ciates with an event is measurable. 

5. The measures of S's utility reflect 
the utilities operating during the 
subjective probability measurement 
session, i.e., the utilities have not 
changed during the period intervening 
between the utility and the subjective 
probability measurement sessions, 

In addition to the above, the as- 
sumptions inherent in the method 
used to measure S's utilities must be 
added. If the utility measures are 
based upon the assumption that 
utility differences of all magnitudes 
are discernable (Coombs & IKomorita, 
1958; Edwards, 1955; Luce & Raiffa, 
1957; Savage, 1954; Siegel, 1956), i.e., 
that any difference in SEU leads to 
100% preference for the alternative 
with the higher SEU, this assumption 
becomes part of the present model. 


proba- 


‘This is a modification of Edwards SEU 
model in that he did not require that the sum 
of subjective probabilities equal unity. 
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However, if the utility measures are 
based upon the assumption that 
preferences are stochastic when utility 
differences are small (Coombs, 1959; 
Davidson & Marschak, 1959; David- 
son, Suppes, & Siegel, 1957; Luce, 
1959; Suppes & Walsh, 1959), then 
the stochastic assumptions become 
part of the present model. 

Figure 1 shows the SEUs of two 
alternatives Yiand Ys as functions of 
subjective probability. The figure 
illustrates a situation in which you, 
as S, must choose between going to 
the airport (Yi) or going to the Green 
Room (Y; In this example you 
want to meet Bill, who will be either 
(Bi) at the airport, or (B) at the 
Green Room. The satisfactions (utili- 
ties) you expect to derive from the 
four possible outcomes of this situa- 
tion are given by $n, dis, G21, and $»». 

The model predicts that you will 
choose Y, (the airport) whenever your 
Subjective probability that Bill is at 
the airport, (Bi), is greater than Xi. 
Note that X, is the subjective proba- 
bility at which the two alternatives 
have equal SEU and intersect. (If a 
stochastic model is used to determine 
utilities, Y; will bea range rather than 
a point) Note also that the sum 
of the subjective probabilities, y(B) 
ay V (B,), is unity, indicating that you 
are sure Bill will be either at the air- 
Port or the Green Room, not at both, 
and not somewhere else. 


MEASURING SUBJECT PROBABILITY 


The subjective probability method 
Proposed in this paper is based upon 
the reverse of the above interpreta- 
tion, ie, instead of predicting the 
alternative S chooses, the method 
€stimates S's subjective probability 
x the time of choice. From Figure i, 

is evident that when S chooses 

i ernative Y,, his subjective proba- 

ity y (B4), must be between X; and 


WHERE: 


$(22)> eu) g2n> 2) 


(22) $(22) 
ean YI) (y2) aun 


(Qu L. ean 
eua (12) 


S(BI) 100 X; 50 0,00 
s(B2) 000 X2 .50 1.00 


Hypothetical problem illustrating 
SEU model. 


Fic. 1. 


unity, whereas when S chooses Ys, 
V(Bi) must be between X; and 0. 
Knowledge of both the utilities and 
the choice permits one to place upper 
and lower bounds on V (B1). 

The range between the upper and 
lower bounds can be decreased by 
having S choose from among several 
alternatives. In general, if N differ- 
ent utilities are known, it is possible 
to construct JV? different two-outcome 
alternatives. 

Let (a; p;a;) represent an alter- 
native, A, that pays ai when p 
occurs and a; otherwise. If (a1) 
> u(a:) > -+> > (aq) it follows that 
(a, p; a) is preferred to all other 
alternatives at all probabilities except 
y(p) = 1 at which point all alterna- 
tives of the form (a1, p; X) are equally 
preferred and except at the point 
V(p) = 0 when all alternatives of the 
form (x, p; a1) are equally preferred. 
Alternative (a1, P; d») is preferred to 
all alternatives of the form (ai, p; x) 
when u(x) € u(as) and is preferred 
to all alternatives of the form (x, p; a2) 
when u(x) € " (aj). However, there 
are N — 1 alternatives of the form 
(x, p; a1) which are preferred to 
(ay, p; a2) when V(p) is small, but 
are not preferred when V (p) is large. 


138 Gorpon M. BECKER 


(I) 


9 (II) 
SEU 


(IT) 


D. 

GF EH 
SUBJECTIVE PROBABILITY OF X 

Fic. 2. Example illustrating relationship 


between SEU, subjective probability, and 
alternative choices. 


Hence, these N — 1 alternatives inter- 
sect (a1, p; a) at various (p). In 
general, alternatives of the form 
(ai, p; aj) intersect N — i alternatives 
of the form (x, p;y), when u(x) 
< u(a;) and u(y) > u(aj). Figure 2 
shows the nine two-outcome alterna- 
tives that can be formed from three 
distinct utilities and the nine subjec- 
tive probability intersection points 
that result. In general, N distinct 
utilities will result in N? two-outcome 
alternatives and N?(N — 1)?/4 inter- 
sections. 

Let the subjective probability at 


the point at which Alternative A inter- 
sects Alternative B be y¥(A, B)— 
where A = (a, p; a^) and B= (b, p; b’). 
Since the SEUs of 4| and B are equal 
at the intersection point, it follows 
that: 


Yula) + [1 — o(p) Jue’) 
= ¥(p)u(b) + [1 — y (p) Ju’) 


and: 


u (a^) — u(b/) - 
u(b) —u(b) —u(a)--u (a) 


Itisevident that y (.1, B) 2y(B, A), 
ie., that reversing the order of the 
alternatives does not change the sub- 
jective probability at the intersection 
point. 

However, reversing the payoffs 
(utilities) within an alternative will 
change the crossover point. Let 
Alternative A’ be (a', p;a) and let 


Alternative B’ be (b, p; b). Itfollows 
that: 


V (A, B) 


(a) — u (b) 


y (A ; BY u (b) —u(b) —u(a^) 3-u (a) 


AVAILABLE PAYOFFS 


SUBJECTIVE 
EXPECTED 
UTILITIES 

OF 

PAYOFFS 

AND 

ALTERNATIVES 


-ALLPOSSIBLE 1 HL 2 8 
INUMBER Of Z 
EEEE j 4 
RANKS FOUR 1 2 
REQUREO | THREE 1 z 4 
TWO 1 
SUBJECT ONE 1 2 3 
| MAXIMUM NUMBER OF OBTAINABLE SUBJECTIVE PROBABILITY INTERVALS 


Fic. 3. The number of obtainable subjective probability intervals as a function of the number 
of required choices and the number of available payoffs. 
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Comparing (1, B^) with YC, B) 
it will be seen that VCI, B) = 1 
— (Al, B). 

It is also evident, from the above 
equations, that P(A, -l’) = 1/2, ie., 
reversing the payoffs within an alter- 
native will result in the reversed 
alternative intersecting the original at 
V(p) = 1/2. In general, NCN — 1)/2 
of the intersections of the two-out- 
come alternatives formed from N 
unique utilities will intersect at ¥(p) 
= 1/2. One need know only the 
ranks of the two utilities used in these 
alternatives to determine which alter- 
native has the higher SEU at subjec- 
tive probabilities greater (or less) 
than 1/2. 

, The maximum number of unique 
intersections of two-outcome alterna- 


tives that can be achieved from N 


unique utilities is therefore: 


Un — Tn — 28) 3-3 
n 


Che maximum number of subjective 
probability intervals that can be de- 
termined from N unique utility points 
is one more than the number of unique 
intersections, as shown in Figure d. 


Utilities Measured on Interval or Ratio 
Scales 


One method for estimating the 
Subjective probability operating at 
the time of S's choice is to have S 
select one of several alternatives. For 
example, if S must choose between 
\lternatives E, D, and F shown in 
“igure 2 and he chooses Alternative E, 
his subjective probability, according 
to the model, must be between ED 
and unity, Whenever S selects Alter- 
Native F, his subjective probability 
Must be between DF and 0; and when 

selects Alternative D, his subjective 
Probability. must be between ED and 
"d If the utilities, 6 (D), & (1D), and 

I) are measured on a ratio or in- 


terval scale, the numerical values of 
these probability bounds can be deter- 
mined from the equations for the 
intersections of the alternatives. 

The range between the upper and 
lower bounds can be reduced by hav- 
ing S again choose between alterna- 
tives. If S's original choice is Æ, 
Alternatives £ and C might be pre- 
sented. Given the initial preference 
E, if S now selects E over C his sub- 
jective probability must be between 
unity and EC, whereas a choice for 
C at this point would indicate a sub- 
jective probability between EC and 
ED. Similarly, Alternatives E and F 
might be presented if the original 
choice were D, and Alternatives B and 
F might be presented if the original 
choice were /. Given three utilities, 
it is possible, from two responses, to 
estimate which of six intervals con- 
tains S's subjective probability. A 
third choice would permit the experi- 
menter to determine which of eight 
subjective probability ranges contains 
S's subjective probability. 

Another technique for obtaining 
increased precision requires only a 
single response. S ranks his prefer- 
ences for many alternatives instead 
of simply selecting the one he prefers 
most. Figure 3 shows the maximum 
number of intervals obtainable with 
from one to six utility points. Notice 
that the number of intervals depends 
upon the depth of the ranking re- 
quired of S and the number of 


utility points used. 


Utilities Measured on an Ordinal Scale 


If only the ranks of the utilities are 
known, at least 2N — 2 ranked proba- 
bility intervals can be obtained. As 
indicated earlier, knowledge of the 
ranks of two utilities is sufficient in- 
formation to enable the experimenter 
to construct a pair of alternatives 
such that S's choice indicates whether 
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his subjective probability is greater 
than, equal to, or less than 1/2. 
Alternatives B and C, and E and 7, 
and G and JI (Figure 2) are examples 
of such alternatives. 
In order to obtain 2N — 2 ranked 
probability from N different utilities 
when only the ranks of the utilities 
are known, construct a set of 2N — 3 
two-outcome alternatives. Let one 
alternative, .|N, consist of winning 
the highest utility, a, when Event p 
occurs, and the lowest utility, 7, 
otherwise, i.e., AN = (a, bin). From 
each of the other N — 2 utilities con- 
struct two alternatives, Let one 
alternative from each utility be a sure 
thing, and let the other alternative 
consist of winning the next lower 
utility when p does not Occur, e.g., 
from Utility i construct Alternatives 
II and IJ where [| = (i, p; 7) and 
IJ = (i, p; j) and where the rank of 
u(i) is one higher than the rank of 
u(j). It is evident that Alternative 
II intersects AN at a higher sub- 
jective probability than the inter- 
section of AN and IJ, i.e., (AN, II) 
> (AN, IJ). Moreover, it is evi- 
dent that VAN, BB) > V(AN, BC) 
> WAN, CC) > ++. > WAN, MM) 
where u(a) > u(b) > > u(m) 
> u(n). Two additional alternatives 
can be formed to complete the set of 
2N — 2 alternatives. One of these 
alternatives, BA = (b, b; a), will in- 
tersect AN at the highest subjective 
probability, and the other alternative, 
NM = (n, p; m) will intersect AN at 
the lowest subjective probability. 
Therefore, the 2N — 3 alternatives 
formed in this manner intersect A4 N 
at 2N — 3 different subjective proba- 
bilities. The ranks of the subjective 
probabilities at the intersection points 
are strictly determined by the ranks 
of the utilities. Hence, from N differ- 
ent utilities it is possible to construct 
a set of alternatives such that 2N — 2 
subjective probability intervals can 


Gonpox M. BECKER 


be ranked from a knowledge of only 
the utility ranks. 

A second technique for measuring 
subjective probability when only the 
utility ranks are known makes it 
possible to measure both utility and 
subjective probability simultaneously. 
Let the N utilities be ranked as be- 


fore: u(a) > u(b) > > u(n). 
Construct the following N two-out- 
come alternatives: AN = (a, pin) 


BM = (b, p; m); CL = (c, T 
NA = (n, p;a). These N alternatives 
intersect, at most, at (N*—2N-4-2)/2 
distinct Probability points to form 
(N?—2N+4) /2 probability intervals. 
Consider the case of four utilities, 
ranked a highest and dlowest. Let the 
four two-outcome alternatives, con- 
structed in accordance with the above 
plan, be: 4 = (a, ~;d);B = (b, pic); 
C= (c p;b);and D = (d, p;a). The 
difference between Utilities a and b, 
Le, (a) — u(b), must be less than, 
equal to, or greater than the difference 
between Utilities c and d. But when 
u(a) — u(b) « u(c) — u(d), the ranks 
of the SEUs of the Alternatives 4, B, 
C, and D are: 


D>C>B>Awhen 0 <vip)< D 
D=C>B>A when Miers 5 
C>D>B>A when ¥(C, D) «y(p) «v (B, D) 
C>D=B>A when V(b) 24 (B, D) 
C>B>D>A wheny(B, D) «y (p) «1/2 
C=B>D=A when ¥(p)=1/2 
B>C>A>D when <¥(p)<y(A, C) 
B>C=A>D when ¥(b) = (A, C) 
B>A>C>D when ¥(A, C) «V (p) «y(A, B) 
A=B>C>D when V(b) 2V(A, B) 
42 B» C» D when (A, B) «V (p) <1 


Note that only 8 of the 24 possible 
rankings of 4, B, C, D occur when 
u(a) — u(b) < u(c) — u (d). 

When u(a) — u(b) — u(c) — u(d), 
the SEU ranks are: 


1/2 


D>C>B>A when 0 Xy(p)«1/2 
A=B=C=D when vp) 51/2 
A>B>C>D when 1/2 «y (p) <1 


Note that all 24 rankings of 4 By GD 
are possible when x(a) — u(b) = u(c) 


3» .. A-8 OF 


-——— 
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— uld), but only two rankings occur 
when ¥(p) is different from 1/2. 
When s(a) — u(b) > ule) — u(d), 
the SEU ranks are: 
D>C>B>A when 0 <P(p)<¥(A, B) 
D>C>B=A when V(p =(A, B) 
D>C>A>B when ¥ (A, B) <¥(p) <¥(A, C) 
D>C=A>B when V(p-v(4, C) 
D»A»C»Bwheny(A, C) «(p) «1/2 


D—A»C-B when y(p)=1/2 
A>D>B>Cwhen 1/2 <y(p)<¥(B, D) 
A>B=D>Cwhen ¥(p) =¥(B, D) 


A>B>D>C when ¥(B, D)<¥(p) «v(C, D) 
A>B>C=D when ¥(p)=¥(C, D) 
A>B>C>D wheny(C, D) <¥(p)S1 


Note that only 8 of the possible 24 
rankings of .1, B, C, D occur when 
u(a) — u(b) > u(c) — u(d). 

From the above, it is evident that, 
when ¥(p) is different from 1/2, six 
rankings of the alternatives occur 
only when x(a) — u(b) > u(c) — u(d) 
and six other rankings occur only 
when u(a)—u(b) <u(c)—u(d). Thus 
if S assigns any of these 12 distinctive 
rankings when y(p) is not 1/2, the 
experimenter can immediately rank 
the subjective probability of p operat- 
ing at that time and, at the same time, 
determine whether u(a) — u(b) is 
greater or less than 4(c) — u (d). If 
u(a) — u(b) is greater (less) than 
u(c) — u(d), 10 of the 24 rankings 
will never occur, and if the utility 
differences are equal then 22 of the 24 
rankings will never occur when (p) 
is different from 1/2. Therefore when 
subjective probabilities different from 
one-half are used, the experimenter 
can immediately determine from S 
rankings, whether (a) — u(b) is 
greater than or less than x(c) — '! (d), 
can immediately order the six sub- 
jective probability intervals obtain- 
able from the four utilities, and ean 
simultaneously determine which of 
the six intervals contains the (p) 
Operative at the time of S's ranking. 

Thus, given only the ranks of the 
utilities, it is possible to order the 
utility differences and the subjective 
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probability intervals from S's rank- 
ings of the alternatives. 

However, the above method does 
not allow for stochastic choice. If 5 
ranks the alternatives consistent only 
with (a) — u(b) > u(c) — u(d) at 
one time, and consistent only with 
u(a) — u(b) < u(c) — u(d) at another 
time, the method will lead to con- 
tradictory results. If people show 
such inconsistencies in their rankings 
it will be necessary to modify the 
method by extending it to include 
stochastic rankings of alternatives. 


PRELIMINARY TEST OF THE 
GENERAL METHOD 


'Two preliminary experiments wer 
conducted to explore whether or not 
Ss vary their selection of alternatives 
in accordance with the difficulty ofa 
discrimination task and the objective 
probability of an event. A visual 
discrimination task was selected be- 
cause it did not contain any explicit 
probability measures, because it pro- 
vided a fairly critical test of the 
method, and because it permitted 
unambiguous control of the stimulus. 
Both experiments were conducted at 
Connecticut College? using the same 
23 undergraduate women. 

Each S was given a set of 45 prob- 
lems from which her utility scale for 
points could be determined by linear 
programing (Davidson). A problem 
consisted of a choice between two 
alternatives. Each alternative con- 
sisted of winning 1500, 1000, 500, or 
0 points or of losing 500 or 1000 
points if Event ZEJ occurred; and 
winning or losing one of the above 
point scores if Event ZOJ occurred. 
The subjective probability of ZEJ 
(ZOJ) was assumed to be equal to 1/2. 
Linear programing solution could not 


5 The assistance of Mortimer H. Appelzweig 
and George O. Moeller in providing Connecti- 
cut College Laboratory facilities and students 


is gratefully acknowledged. 


Gorpon M. BECKER 


TABLE 1 
DrRivED UriLITY SCALES 
Objective Point Offers 
Subject : 7 = $ 
—1000 —500 0» 500» 1000 1500 
OM E —1875 — 1000 0 500 | 1375 1025 | 625 
AG — 1061 —643 0 500 | 928 1142 71 
MS —2250 —1000 0 300 | 1000 1250 250 
JN —1500 —833 0 500 1000 1667 167 
BW —1333 —667 0 500 | 750 1083 83 
GH —1000 —750 0 500 | 750 1500 125 
KT —1333 —667 0 500 833 1333 107 
cs — 1000 —500 0 500 875 1125 125 
LB —1250 —500 0 500 750 1250 250 
EJ —1300 — 100 0 500 | 800 1200 100 
L. 


a The zero points of all utility scales were arbitrari 
units of all utility scales were arbitrarily set by assignin 


utilities of zero and 500 objective points. 


be found for many Ss. The scales for 
10 of the Ss for whom solutions were 
found are shown in Table 1. Al- 
though the subjective distances be- 
tween the point offers differed from 
the objective distances for most Ss 
the rank-order of the points and the 
rank-order of the probabilities at 
which alternatives were preferred did 
not differ from the objective for the 
alternatives used in the two ex- 
periments. 


Experiment I: Visual Discrimination 


In the first experiment, S chose one 

of five alternatives on the basis of 

her ability to discriminate which of 

two line clusters contained the greater 

number of lines. Unknown to S, one 

of the two clusters always contained 

100 lines. The other contained 75, 80, 

85, 90, 95, 100, 105, 110, 115, 120, or 

125 lines. The lines were 1/8 to 1/4 

inch long and were randomly dis- 
tributed in an irregular, egg shaped 
pattern about 3/4 inch high and 114 
inches long. Both clusters were of 
the same irregular shape and area. 
Sixty pairs of these clusters were re- 
produced in mimeographed test book- 


ily set at zero objective points. The 
g 500 units to the distance between the 


lets. Each booklet contained the 
same 10 pages with six problem pairs 
per page, but the order of the pages 
was randomized from one S to the 
next. The top cluster in eacl 
pair was labeled QUG and t 
cluster was labeled QOB. Two of the 
QUG areas contained 75 lines, two 
contained 80, 85, 115, 120, and 125 
lines. Four of the QUG areas con- 
tained 90, 95, 105, and 110 lines. 
Each of the 28 QUG areas was paired 
with a QOB area containing 100 lines, 
An additional 28 problems reversed 
the QOB and QUG areas, Four 
Other problems were also presented 
in which QUG and QOB each con- 
tained 100 lines, 

.S was told that the test was de- 
Signed to test her ability to estimate 
the number of Objects in an area. 
The instructions informed S that she 
would be paid 1 cent for every 500 
points she won, and that she would 
lose 1 cent for every 500 points she 
lost. However, S was guaranteed to 
net at least 1 dollar regardless of her 
losses. She could, if she selected 
properly, win up to 2 dollars, S won 
points by choosing one of the five 


h problem 
he bottom 


Two 


Dzcisiox 


alternatives described below for each 
pair of QUG and QOB areas. 


Alternative CIJ: You win 1500 points if 
there are more lines in area QUG than there 
are in area QOB, and you lose 1000 points if 
there are more lines in area QOB. 

Alternative MEQ: You win 1000 points if 
there are more lines in area QUG than in 
QOB, and you lose 0 points if there are more 
lines in QOB. 

Alternative ZEJ: You win 500 points for 
sure. 

Alternative XIP: You win 1000 points if 
there are more lines in area QOB than in 
QUG, and you lose 0 points if there are more 
lines in QUG. 

Alternative FEH: You win 1500 points if 
there are more lines in area QOB than in 
QUG, and you lose 1000 points if there are 
more lines in QUG. 


These alternatives are shown in 
Table 2, together with the alternatives 
offered in the next two experiments. 

Two lines were drawn to the right of 
each problem in the test booklet. 
One line was labeled Side A, and the 
other was labeled Side B. S wrote the 
name of the alternative she preferred 
on the Side A line. If she wished, 
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she wrote the same name on the 
Side B line, or she wrote the name of a 
different alternative. S was told that 
only one of the alternatives indicated 
on the two lines would be considered 
in computing her score. This alterna- 
tive was determined by flipping a coin 
labeled Side A on one side, and 
Side B on the other. If S wrote the 
same name on both lines, she was 
sure to have that alternative count 
towards her score. If she wrote 
different names on the two lines, one 
or the other would be selected, each 
with a 50/50 chance. This procedure 
distinguished between real preferences 
for an alternative and those forced by 
a choice between equally desirable 
alternatives. This method provides 
a seven-interval probability scale as 
shown in Table 3. 

In order to avoid changing S's 
utility for points during the experi- 
ment, S was not told which area con- 
tained more lines, which side of the 
coin appeared face up, or her point 


TABLE 2 
ALTERNATIVES OFFERED THE SUBJECT BY PROBLEM AND CONDITION 


Condition 


Problems 


S chooses column alternative. .... +--+" ° cu 


Alternatives 


MEO |ZEJ| XIP | FEH 


1500 | 1000 | 500 0 |— 1000 


Discrimination If top area contains more lines, 
S receives 
If bottom area contains more —1000 0 | 500 | 1000 1500 
lines, S receives 
c S chooses column alternative... «s VUK WUB XEH YOF 
First set If ZEJ card is drawn, S receives 1500 | 1000 500 | — 1000 
Probability Deck | If ZOJ card is drawn, S receives | —1000 500 | 1000 1500 
S chooses column alternative. . - + | on F FHJ 28) | DJO 
fron IF XIH card is drawn, S receives | — 1000 o | 1000 | 1500 
Probability Deck If XEH card is drawn, S receives 1500 | 1000 o | —1000 


144 


TABLE 3 
PROBABILITIES ASSOCIATED WITH DISCRIMINATION RESPONSES 
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score until she had completed all 
three phases of the experiments. The 
utility scale, the discrimination and 
the probability deck problems, dis- 
cussed in the next section, were all 
administered during one group session 
that lasted 114-2 hours. 

If the SEU model and the method 
for measuring subjective probability 
are appropriate, we expect S to choose 
a high probability alternative (CIJ- 
CIJ) whenever she is sure that QUG 
contains more lines, to choose FEH- 
FEH whenever she is sure QUG con- 
tains less lines, and to vary her choice 
in the seven intermediate steps shown 
in Table 3 as her confidence in her 
ability to discriminate varies. (One 
also expects a high correlation between 
S's confidence, 
subjective probability associated with 
her choice, and the difficulty of the 
Since the ranks 
Obability levels 
objective prob- 
10 Ss for whom 
utility scales were obtained (Table 3) 
a high correlation would be expected 
ranks of the objective 
probabilities associated with alterna- 
tives selected by § and the difficulty 
of the discrimination task. The mag- 
nitude of the difference between the 
number of lines in the two areas pro- 
vides an objective basis for ranking 
the difficulty of the discrimination, 

hus, if a significant correlation be- 
tween the ranks of the stimulus differ- 
ences and the ranks of the probability 
intervals associated with the alterna- 
tives is found, it would support the 
Notion of using the Proposed method 


to measure Subjective probability, 
confidence in isi 


ambiguity 

Results. Two Kendall coefficients 
of concordance were calculated for 
each S, one for four replications of the 
five problems involving 100, 105, and 
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110 lines in QUG and 100, 105, and 
110 lines in QOB; and the other for 
the four replications of the five prob- 
lems involving 90, 95, and 100 lines 
in QUG and QOB. 

Spearman average rank correla- 
tions obtained from these coefficients 
are shown, for each S, in Table 4. 
The "best" estimates of each S's 
choice, on each problem, obtained by 
averaging the S's four choices on 
each problem, were used to compute 
a coefficient of concordance for the 
group as a whole. The average 
Spearman rank correlation. obtained 
from the coefficient is labeled Total in 
Table 4. Note that 29 of the 46 
correlations are significant beyond the 
-05 level and that the totals are sig- 
nificant beyond the .01 level. 


EXPERIMENT Il: OBJECTIVE 
PROBABILITY 


The results of a second pair of tests, 
which employed explicit probabilities 
rather than line areas, are also shown 
in Table 4. S was told that one card 
Would be drawn from a deck contain- 
ing 100 cards. Each card in the deck 
was labeled either ZEJ or ZOJ in the 
first set, and either XIH or XEH in 
the second set. S was told exactly 
how many ZEJ (or XIH) cards were 
in a particular deck. In the first set, 
She selected one of the four "Prob- 
ability Deck" alternatives shown in 
Table 2 and described below: 


VUK: You win 1500 points if ZEJ is drawn, 
and you lose 1000 points if ZOJ is drawn. 
WUB: You win 1000 points if ZEJ is drawn 
and you win 500 points if ZOJ is drawn. 
XEH: You win 500 points if ZEJ is drawn 
and you win 1000 points if ZOJ is drawn. 
OF: You lose 1000 points if ZEJ is drawn 
and you win 1500 points if ZOJ is drawn. 


ate objective probabilities at which 
mee alternatives intersect are shown 
Table 5. In the second set, she 


TABLE 4 


CORRELATIONS BETWEEN OBJECTIVE 
STIMULUS SCALE AND SUBJECTIVE 


PROBABILITY 
Task 

Discrimination Probability 

(5 alternatives) (4 alternatives) 
Subject 
One area One area 
inim tuo. | amea toa.) SE | SS 
(100-105-110)| (90-95-100) 

1 38 .04** eee | 10** 
2 24 Jute | 748" | .78** 
3 30 66** ,81** | .88** 
+ .84** sS BI] 897" 
5 .46* .08 .Sa*e| AQI 
6 0e :61** .87** | .87** 
7 —.03 —.20 468*€| Same 
8 24 .59** ,84** .80** 

9 39 on .86** | .30* 
10 .64** 30 Base | Tot 
11 ao 99 .89** | .80** 
12 orf Jut [.85*5|.78** 
13 ADM .15**  |.84** | .75** 
14 .68** .59** 9i**.| .86** 
15 415 aAT* S9] 89> 
16 Bul .39 .56** | .66** 
17 61** 18 .89**| .617* 
18 255" .15** | .86** | .86** 
19 10 39 .88** | .82** 
20 olf -67** 85**| EU 
21 46* .89** 84** | .82** 
22 sor" 53* i872 | JOR” 
23 —.10 53* 90% | Tot 
Total .834** STL [vtt | ae 


* Significant at .05 level. 
** Significant at .01 level. 


selected one of the four alternatives, 
shown in Table 2 and described below : 


QJF: You lose 1000 points if NIH is drawn 

j 1500 points if XEH is drawn. 

n 0 points if XIH is drawn 

and you win 1000 points if XEH is drawn. 
ZB] : You win 1000 points if XIH is drawn 

and you win 0 points if XEH is drawn. 
DJQ: You win 1500 points if XIH is drawn 

and you lose 1000 points if NEH is drawn. 


The probabilities at which these alter- 
natives intersect are shown in Table 5. 


and you win 150 
FHJ: You wi 
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lach set of problems was con- 
structed from 19 different decks. 
These decks contained the following 
numbers of ZEJ-ZOJ(XIH-XEH) 
cards: 0-100, 10-90, 15-85, 20-80, ---, 
75-25, 80-20, 85-15, 90-10, 100-0. 
Thirty problems were constructed 
from each set of 19 decks by present- 
ing 11 decks twice and 8 decks® once. 
Kach set of 30 problems was included 
on a separate page in the test 
booklet after the discrimination. ex- 
periment problems and the utility 
scale problems. 
5 again indicated the one, or two, 
alternatives she preferred, using the 
same Side A and Side B procedure. 
She was not informed of her point 
Score on either set, nor of her point 
Score on the discrimination or utilitv 

problems during the experiment. 
Results. Kendall tau correlations 
between the objective probability 
ranks of the decks and the ranks of 
the alternatives selected by S, assum- 
Ing S is rational, were computed for 
each S on each of the two problem 
sets. The results are shown in Table 
4. The correlations labeled Total are 
average Spearman rank correlations 
Obtained from Kendall's coefficient of 
concordance. The four entries in 
fach row in Table 4 were obtained 
p the same S during the same 

rimental session. 

mone that all 46 tau correlations 
jr ed for the probability prob- 
e aie significant beyond the .05 
; and 45 of the 46 are significant 
yond the .01 level. The average 
“Pearman rank correlations are .87 
me set and 16 for the second 
—— hese results indicate n 
lity to select high subjective Eo s 
vënia alternatives for highly IKely 
» and low subjective proba- 


6 ^. 
the ae eight decks presented only once were 
10 7900/0, 85/15, 80/20, 70/30, 30/70, 15/85, 
' and 0/100. 


bility. alternatives for events of low 
probability. 


Summary of Experimental Results 


The empirical results in both the 
discrimination and probability tasks 
support the use of the proposed 
method as a means for measuring sub- 
jective probability, confidence, and 
bias, and suggest that the method 
may provide a means for measuring 
stimulus ambiguity. The high corre- 
lations also indicate that it is an 
acceptable procedure for use in indi- 
vidual prediction. The method pro- 
vides an estimate of subjective proba- 
bility from a single response. 

The fact that the correlations are 
less than 1.0, however, indicates that 
Ss do not consistently choose the same 
response to the same stimulus. This 
may reflect fluctuations in utility 
during the experiment, or may result 
from obtaining subjective probability 
measurements that were more precise 
than those employed by 5, or the 
results may be due to real changes in 
S's expectancy from one stimulus 
presentation to the next. The fluctua- 
tions may further have been due, in 
this experiment, to the alternatives 
having approximately equal SEU over 
a wide range of discrimination prob- 
lems and objective probabilities. 


(CONCLUSIONS 


This paper described several meth- 
ods for obtaining measures of subjec- 
tive probability from a single response. 
One method required a priori knowl- 
edge of S's utility scale and the other 
required that S never rank a set 
of alternatives inconsistently. Both 
methods were based on modifications 
of the SEU model and required that 
utility and subjective probability not 


interact. 
Two experiments were conducted 
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to test one of the assumptions basic 
to both methods: the assumption that 
people vary their preference for alter- 
natives according to subjective proba- 
bility. The results supported this 
assumption, indicating that if utility 
and subjective probability do not 
interact, estimates of subjective prob- 
ability can be obtained from a single 
response. 

Several questions remain. The 
methods must be subjected to further 
empirical test. The reliability of the 
subjective probability estimates, the 
effects of increasing the number of 
alternatives presented to S, the rela- 
tionship between subjective and ob- 
jective probability, and the validity 
of the basic assumptions of the SEU 
model are still to be determined. 
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A NOTE ON INTENSITY GENERALIZATION 
AND PROTHETIC SCALING 


SHELDON H. WHITE 


The University of Chicago 


; Tests for stimulus generalization to va- 
riation in stimulus intensity have gen- 
erally followed a typical procedure. A 
set of three, four, or five stimulus in- 
tensity values are selected. One group 
of subjects learns a response to the weak- 
est intensity, another group learns to the 
strongest, and then both groups are tested 
for generalized responding to the com- 
plete set of stimuli. 

The two-way gradients found by such 
a procedure have shown a typical asym- 
metry despite a respectable diversity of 
types of subject, stimulus dimensions, 
and learning paradigms. A description 
of the asymmetry given by Hull (1949) 
notes five points: (a) The curve of the 
group trained at the weak intensity value 
is concave downward; (b) The curve of 
the strong-trained group is concave up- 
ward; (c) The strong-trained group 
shows stronger response to its training 
Stimulus than does the weak-trained 
group; (d) The downward curve is 
Steeper in slope than the upward curve; 
(e) The upward and downward curves 
Intersect at their midpoints. 

, Explanations of the asymmetry have 
involved an energizing property of CS 
intensity (Hull, 1949), or else a dif- 
erential conditioning process involving 
Contextual stimuli (Logan, 1954; Perkins, 
1953), However, the shape which a 
Curve will appear to have depends, of 
Course, upon baseline and ordinate scal- 
ing. The purpose of the present note 


.! Study performed with the aid of a Ford 
Foundation in Behavioral Sciences research 
Brant. The writer wishes to express his 
Sratitude to Donald W. Fiske, Austen H. 
iesen, and Charles C. Spiker for helpful 
Critical readings, and also to thank Paul 


Tim for computing the asymmetry indices 
the two figures. 


is to point out that: (a) newer psycho- 
physical scales offer an alternative to 
the practice, traditional in intensity gen- 
eralization studies, of selecting intensity 
values at equal logarithmic intervals; (5) 
arbitrary logarithmic scaling can produce 
generalization asymmetry resembling the 
typical picture under discussion; and (c) 
rescaling baselines of existing curves in 
terms of magnitude-estimation scales re- 
duces the apparent asymmetry of some of 
the intensity generalization data. 


SCALING OF PROTHETIC DIMENSIONS 


Stevens (Stevens, 1957; Stevens & 
Galanter, 1957) has recently presented 
evidence for a dichotomy among stimulus 
dimensions. Those dimensions along 
which stimulus change appears to take 
place by addition or subtraction of stimu- 
lation are classed as prothetic, while 
other dimensions, for which change is to 
some extent a matter of substitution, 
are classed as metathetic. — Prothetic 
dimensions are generally associated with 
intensity attributes. The stimulus di- 
mensions along which asymmetry of gen- 
eralization has been found (brightness, 
loudness, and area) are classified as 
prothetic, while those which have showed 
approximate symmetry (hue, pitch, locus 
on skin, locus in space, length of line) 
are either metathetic or have been treated 
as though they were. That is, experi- 
mental sets of stimulus values have not 
been logarithmically spaced in the man- 
ner usually employed for prothetic di- 
mensions. 

The logarithmic spacing of a series of 
intensity values arranges them so that 
approximately equal numbers of jnd's 
fall between successive values in the 
series, assuming that the stimuli so spaced 
are thereby also arranged in even steps 
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of similarity, or psychological distance. 
The plausibility of the assumption has 
led investigators of stimulus generaliza- 
tion to space their stimuli in accord with 
the principle; Hovland (1937a) actually 
counted jnd's to locate stimuli, and later 
studies have used the logarithmic spacing 
as a good approximation of this pro- 
cedure. 

Another psychophysical estimate of ap- 
parent stimulus distances produces an 
alternative scaling. The method of mag- 
nitude estimation requires the observer 
to assign numbers to stimuli in accord 
with what he judges their intensities to 
be. Differences between numerical mag- 
nitude estimations for stimulus intensities 
are typically in better agreement with the 
actual physical energy differences be- 
tween those stimuli than the jnd curve 
might lead one to expect. That is, mag- 
nitude estimations are generally related 
to physical energy values by a power 
function, of less curvature than the 
logarithmic jnd function. Some dimen- 
sions even show a linear relationship be- 
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tween magnitude estimation and physical 
value (Stevens & Galanter, 1957). 

While logarithmic (jnd) scaling has 
been traditional and accepted, magnitude 
scaling has some points in its favor. The 
summing of jnd's to estimate psychologi- 
cal distance rests on an unproven as- 
sumption, historically controversial, that 
jnd's are equal, summable, “units” of sen- 
sation. Magnitude scaling does not ap- 
pear to require such a critical assumption. 
Further, magnitude scaling, and only this 
psychophysical technique to date, has 
Showed some amount of validity by per- 
mitting quantitative cross-modality scal- 


ing predictions (Stevens, Mack, & 
Stevens, 1960). 


Errecrs OF LOGARITHMIC SCALING 


What is the effect of baseline scaling 
on a two-way generalization curve? 
Suppose that five intensity values have 
been chosen so that the distances between 
their logarithms are equal. The line at 
the top of Figure 1 illustrates the place- 
ment of the stimuli, A-E, on the physical 


EX Se So Se 
Se So Se SS, 
Actual Spacing 


inappropriately drawn over a loga- 


ee P —€ 


rithmically scaled baseline (Graph A), and drawn ov 
B). (The training points for the weak-trained gro 
strong-trained group—filled circles, solid line, are at the left, and the curves extend right- j 
ward. The parenthesized numbers are a rough index of asymmetry, representing the 
number of square inches between the curves as originally drawn in 3" x 3" size.) 


er the correctly scaled baseline (Graph 
up—open circles, dashed line, and the 
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dimension; the traditional assumption 
would be that they mark off approxi- 
mately equal apparent steps. 

A  weak-trained group of subjects 
learns a response to S4, while a strong- 
trained group learns to Sp, following 
which both groups are tested for gen- 
eralized responding to the complete set 
of five stimuli. We will suppose that 
the generalization. function is linearly, 
symmetrically related to physical energy 
differences between training and test 
stimulus; the subject generalizes in direct 
proportion to the physical energy dif- 
ferences between stimuli. 

Hypothetical results under the above 
conditions are given in Figure 1, in both 
the A and B graphs. In the A graph, the 
Spacing of the stimuli along the abscissa 
is even, following the assumption that 
equal logarithmic distances represent 
equal apparent stimulus change. The 
generalization curves appear to be mark- 
edly asymmetrical, with that for the 
weak-trained group being concave down- 
ward and that for the strong-trained 
group being concave upward. 

In the B graph, the stimuli are spaced 
according to their physical energy dis- 
tances, and the curves for the weak- 
trained and strong-trained groups are 
Symmetrical and linear. 

It is possible that the practice of 
logarithmic spacing, characteristic of 
Intensity generalization studies, may have 
Analogously led to an observed asym- 
SNL Some support for this argument 
Vill now be sought in the existing litera- 
ture, 

" The analysis of the hypothetical ex- 

Mple just given is to serve as a model 
M. what follows. In that example, it 

as assumed that apparent stimulus dif- 
erences were in exact accord with the 
Physical energy differences between 
stimuli. For the studies to be considered, 
it will be assumed that apparent stimulus 
differences are best estimated from the 
Scales yielded by the psychophysical 
Method of magnitude estimation. 


Sims à 
TUbIES op Inrensity GENERALIZATION 


a study of bi- 


, The general format of 
has 


iron: T x Weed 
*ctional] intensity generalization 
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been described above. Some distinction 
should be made between a "pure" design 
—in which each subject is tested on only 
one stimulus for generalization—and the 
more common counterbalanced design, 
where each subject is tested on all of the 
stimuli, with different subgroups of sub- 
jects used to counterbalance the order of 
testing among stimuli. Despite the fact 
that the data from counterbalanced de- 
signs are more dificult to interpret— 
being a mixture of generalization, ex- 
tinction, retraining, and generalization 
of extinction effects—such designs have 
been used because of the considerable 
economy in number of subjects which 
they permit. 

The relevant literature 
briefly described: 

1. Hovland (1937a) used a GSR con- 
ditioning procedure, with his stimuli 
being four 1000 cycle tones separated in 


may now be 


loudness by 50 jnd steps. Separate 
groups of 16 college students were 


trained at his faintest and loudest tones, 
and then tested by a counterbalanced pro- 
cedure. 

2. Brown (1942) used rats in a 
straight alley approach situation. His 
stimuli were three illuminations of a 
large ground-glass screen at the end of 
the alley. There were two groups of 18 
rats each, tested by a counterbalanced 
procedure. 

3. Grice and Saltz (1950) also used 
rats in a straight alley approach situation. 
Their stimuli were five white circles, 
iour of them graded in area at logarith- 
mically equal intervals and the fifth at an 
intervening half-step. Testing was by a 
pure procedure, with 15 subjects for each 
test stimulus. ‘Two sets of data are to 
be considered here, the resistance to ex- 
tinction data reported originally, and 
speed data reported later (Grice, 1956). 

4. Miller and Greene (1954) used rats 
in a straight runway avoidance paradigm. 
‘The stimuli were three loudnesses of a 
There were 30 rats, tested by a 

Each data point of the 
extinction measure Wits 


buzzer. 
pure procedure. 
resistance to 
based upon five animals. 

5. Spiker (1956) tested children on a 
task in which they responded freely dur- 
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ing 3-second trial intervals. 
were four 


His stimuli 
illuminations of a circular 
ground-glass stimulus window; three of 
the stimuli were at logarithmically equal 
foot-candle interv and the fourth at 
an intervening half-step. Thirty chil- 
dren were given 12 trials of training (15 
weak-trained and 15 strong-trained), fol- 
lowed by a counterbalanced testing pro- 
cedure; another 30 were given 24 trials 
of training before test. These two train- 
ing groups will be considered separately 
below. In addition, the data of the 
pooled groups on their first generaliza- 
tion test trial will be considered, as a 
relatively pure generalization measure. 

_ An additional study (Heyman, 1957) 
is clearly relevant in design but will not 
be further considered because the data, 
as tabled, did not permit clear appraisal 
of curve shapes. 


The A columns of Figure 2 show the 
generalization curves reported for the 
above studies. To provide a clearer 
Picture of the degree of symmetry or 
asymmetry of the curves, the training 
points of both the weak-trained and 
Strong-trained groups have been placed 
9n the left, and the curves extended right- 
Ward. AIl figures have been propor- 
tioned to occupy the same square area. 
The scaling along the abscissae of the 
Column A figures is logarithmic, except 
aoe case of Hoyland, where it is in 
rms of jnd units. 

As he ordinates of the Column. B figures 

identical with those of their Column 
Coi; iinterparts ; the abscissae of the 
Ts umn B figures, however, have been 
a in the terms ot psychophysical 
eee estimations. The stimulus 

““T8y values (E) for each study were 
transformed to magnitude scale values 
(M) using the power function M=E, 
Where the exponent b is à characteristic 
of the dimension established by psycho- 
Physical investigation. For Hovland's 
Oudnesses of 1000 cycle tones, & was 0.3 
aStevens, 1956). For Brown's study, 
md Spiker's, brightness of illumination 
dd for a k of 0.36 (Stevens & Gal- 

er, 1957), Miller and Greene s buzzer 


loy 


(n xc ^ 
tiop Nesses were converted using an equa- 


n : 3 a a 
! for white noise (Stevens, 1958) with 
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a k of 0.3. Finally, Grice's dimension of 
visual area involved a & range of 0.9- 
1.15 (Stevens, 1957); a k of 1.0 was used 
here. 

If the asymmetry of the Column A 
figures were the result of baseline scaling 
alone, their treatment in Column B 
should make them roughly symmetrical. 
Several extraneous factors in the data 
being considered combine to make this a 
fairly gross specification. 

First, the psychophysical data on which 
the rescalings were based comes from 
adult human observers, but three of the 
five studies were done with rats and a 
fourth with children. The Column B 
baselines are therefore only educated 
guesses at a proper scale for those sub- 
jects. 
econd, the response strengths of the 
weak-trained and strong-trained groups 
were not always equal at their respective 
training stimuli. Generalization curves 
extending from training stimuli of dit- 
ferent height would not be superimposed, 
nor is there any good reason to expect 
them to be parallel. Connected in here 
is an ordinate scaling problem—the prob- 
ability that the same amount of response 
change has different implications starting 
from different levels of responding. 

A third factor which might prevent 
rescaling from bringing about full sym- 
metry applies particularly to counter- 
balanced designs. If the line at the top 
of Figure 1 is examined, it will be seen 
that logarithmic spacing of stimuli causes 
those of weaker intensity to be placed 
relatively closer together than stimuli of 
greater intensity. During repeated test- 
ing, the net effect of this is to produce 
less generalization of inhibition at the 
training value for the strong-trained 
group than for the weak-trained group, 
while loading relatively more generalized 
inhibition onto the tail of the strong- 
trained group. The strong-trained group 
consequently shows a steeper gradient 
than does the weak-trained group. This 
difference in steepness is not removed by 
rescaling. It might help account for dif- 
ferences between bright-trained and dim- 
trained subjects at their respective train- 
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ing values, the factor discussed in the 
preceding paragraph. o. 

Now, examining Figure 2B, it is seen 
that the rescaling meets with partial suc- 
cess in symmetrizing the curves. Five of 
the curves—the two derived from the 
Grice data, and the three derived from 
the Spiker study—appear to be brought 
closer to symmetry. These five curves 
represent only three completely inde- 
pendent sets of data among them. On 
the other hand, the rescaling of the Hov- 
land, Brown, and Miller and Greene 
studies has worked no improvement. 


Discussion or STUDIES 


Despite the fact that Hull advanced his 
five rubrics from knowledge of only the 
Brown and Hovland data, the failure to 
symmetrize either these or the Miller and 
Greene data is, to some degree, tied to 
discrepancies between these studies and 
Hull’s typical picture. Hovland's weak- 
trained group shows a generalization 
curve regularly ascending away from the 
training point; Brown's weak-trained and 
strong-trained curves show no overlap; 
the Miller curves are exactly opposite in 
curvature to the standar Only in the 
case of the Miller and Greene data—for 
which N is relatively low and reported 
response variance quite high—is there a 
clear cause to doubt the reliability of the 
reported curve shapes. Indeed, the as- 
cending trend of Hovland's weak-trained 
group was reproduced by him in two 
later sets of intensity GSR generalization 
data (Hovland, 1937b). 

On the other hand, the Grice and 
Saltz, and Spiker data, which on the 
whole provide the best complete approxi- 
mation to Hull's picture, are most altered 
by rescaling. Their residual asymmetry 
might be due to the extraneous factors 
noted above, to sampling error, or to an 
intensity effect which is only reduced in 
apparent magnitude by rescaling. 

Differentiae of studies not made sym- 
metrical by rescaling might include: (a) 
a large difference between response 
strengths of the dim-trained and bright- 
trained groups to their training stimuli ; 
(b) aversive, as opposed to appetitional, 
incentives; (c) auditory, as opposed to 


visual, stimuli; (d) counterbalanced as 
opposed to purc, designs; or (e) a stimu- 
lus set of greater "impact," whether be- 
cause of intensity or of dominance as 
figure against ground. 

What can be said, then, about the re- 
scaling venture? There is no a priori 
reason for preferring one expression or 
picturing of a set of data to another, but 
choices can be made in terms of useful- 
ness for interpretation and communica- 
tion. Here, the limited reduction of 
asymmetry by magnitude scaling may 
usefully clarify where and how much a 
special intensity factor must be invoked. 

There is still evidence for an asym- 
metry of generalization, which may re- 
flect an intensity factor, However, the 
intensity interpretation still faces as a 
difficulty the fact that a majority of the 
half-dozen conditioning studies which 
have searched for an effect of CS in- 
tensity have been unable to find it. 


SumMMary 


The typical asymmetry found in studies 
of two-way intensity generalization may 
involve baseline scaling, since all such 
studies have spaced their stimuli at equal 
jnd distances, either directly or by a 
logarithmic approximation, Magnitude 
estimations of stimuli yield a different but 
justifiable power-function spacing. An 
artificial example of a typical asymmetry 
produced by logarithmic Spacing is given. 
Rescaling of intensity generalization data 
in the literature to magnitude-scale base- 


lines results in increased Symmetry in 
some cases. 
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ALL-OR-NONE VERSUS INCREMENTAL LEARNING! 


JOAN E. JONES 


Defence Research Medical Laboratories; Toronto, 


The process of learning may be con- 
ceived of as the gradual strengthening of 
connections between events or, alter- 
natively, as the establishment of such 
connections in an all-or-none fashion. 
Estes (1960) presented results which 
suggest that evidence for the all-or- 
none position tends to be obscured by 
the traditional techniques of grouping 
data. In one of a number of miniature 
experiments cited by Estes (1960, pp. 
213-215) paired associates were re- 
called on successive test trials (Tj, Ts) 
after a single reinforced presentation 
(R;). The sequence was Rj, Ty, rest, 
Tə Estes argued that, if an incre- 
mental theory is tenable, the probability 
of a correct response appearing on T, 
should be independent of recall on Ti. 
An all-or-none theory suggests that cor- 
rect responses on Ta should occur only 
by chance unless a correct response 
was given on T; (see Figure 1). A 
close relationship was found between 
recall on T, and To, ie, of the 49% 
of cases correct on T, 75% were cor- 
rect on Ts (C-C cases) and of the 51% 
of cases incorrect on T, 91% were in- 
correct on Ty (N-N cases). 

Estes' statement of incremental theory 
forces upon it the assumptions that a 
reinforced presentation results in equal 
increments in associative strength for all 
items in all subjects and that the strength 
of competing responses is equal in all 
cases. His deductions concerning the 
probability of a correct response on To 
do not necessarily follow if these as- 
sumptions are not made. Estes contends 
that he has disposed of the point of item 
difficulty and individual differences by 
demonstrating that for his subjects and 
with his items a second reinforcement 
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(Ra) interpolated between Tı and Ta 
yields approximately the same percentage 
of newly correct responses on Ta (46%) 
as occurred on T, following R, (409% ) 
(Estes, 1960, pp. 215-216). However, 
any prediction of the relative percentages 
of newly correct responses following one 
reinforced presentation and following 
two, involves making assumptions about 
the distributions of item difficulty and 
learning ability. The character of those 
distributions for such a limited number 
of items and subjects can be known only 
empirically, Although the approximate 
equivalence of the obtained percentages 
is consistent with an all-or-none inter- 
pretation, the demonstration does not at 
all preclude the possibility that R, re- 
sults in unequal increments for different 
cases. 

According to the latter view, the asso- 
ciations have achieved unequal strengths 
after R,, hence they do not have equal 
probabilities of appearance, Subclas- 
sifying the cases into those which do 
(C) and those which do not (N) ap- 
pear on T, biases the probabilities of 
appearance on T. without further re- 
inforcement in favor of the C cases. 
Assuming that there is no forgetting 
between T, and T,, the C-N and N-C 
cases represent approximately half of 
those in which the associations ‘are at 
about threshold strength and the num- 
ber of cases in each should be approxi- 
mately equal. (The proportions will be 
equal in the special case where the C/N 
split on T, is 50/50.) Forgetting is 
reflected in an increase in the number 
of C-N cases and a decrease in the 
number of N-C cases. That is, incre- 
mental theory does not have to predict 
the frequency of N-C cases. However, 
it can predict trends in the N-C and C-N 
categories when the conditions of the 
experiment produce greater or lesser 
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degrees of forgetting. According to the 
all-or-none theory, forgetting will be 
reflected in an increase of cases in the 
C-N category but the frequency of cases 
in the N-C category will be unaffected 
unless subjects are guessing intelligently, 
in which case it will decrease. 

Another problem which precludes un- 
critical acceptance of Estes’ interpreta- 
tion is the indeterminate theoretical 
chance level. According to an all-or- 
none theory the number of N-C cases 
should be no greater than that expected 
on the basis of sheer guessing. The 
chance value depends on the population 
of responses among which the subject is 
guessing, which might be a number less 
than, equal to or greater than eight, the 
number of items presented, and cannot 
be specified other than empirically. The 
alternative forms of guessing might be 
referred to as "intelligent," “unintel- 
ligent,” and “stupid” guessing. Further 
information is necessary in order to 
evaluate whether the N-C cases which 
do occur, whatever their number, are 
sheer guesses or possible true recalls. 
This information is not available in the 
data as collected by Estes but an ex- 
tension of his procedure may clarify 
Why N-C cases do occur and also in- 
dicate whether associations have been 
established by R,, though not revealed 
on T,. 

Three experiments are reported here. 
The first is a replication of Estes' experi- 
Ment with the trial sequence Ry, Ti, 
Test" T, confirming his result. The 
Second js a variation and extension of 

Stes’ procedure, the trial sequence being 
Xi, T, To, Tz, Ta. The third is a repe- 
tition of Experiment II using a double- 
lind technique. 

EXPERIMENT Í 

Twenty-seven housewives acted as 
Subjects in Experiment I. Each of 
Sight associate-pairs, a consonant as 
Stimulus and a number as response, were 
town on separate 3” x 24” cards. Ir 
ip ap were the numbers 1 pow 5. 
Posi experimenter sat at a ae Rs 
argo the subject and presente he 

ds manually after shuffling. On R; 


Po*O 
Pos 0 
Q 

Reinforcement Test Rest Test 
Fic. 1. Schema representing effects of 
a single reinforcement according to in- 
cremental (upper branch) vs. all-or-none 
(lower branch) theories. (Squares rep- 
resent subjects with the proportion of 


darkened area in each indicating the prob- 
ability of the correct response, C, for the 
given individual) (from Estes, 1960, p. 214.) 


the cards were each held while the sub- 
ject read aloud the consonant and the 
number. On test trials cards bearing 
only the consonant stimulus were shuffled 
and then presented. The test cards were 
held until the subject responded with the 
number she though correct or with a 
guess. No omissions were allowed. Ta 
followed T, after approximately 7 min- 
utes during which time the subject took 
part in another experiment which in- 
volved sorting cards bearing lines of 
different lengths and orientations. There 
was no warning that a second test would 
be made. . 

Experiment I was not an exact repli- 
cation of Estes’. Apart from the usual 
differences of experimenter, subject, 
time, and place the present experiment 
differed from Estes’ in four respects: 
(a) In the present experiment con- 
sonants rather than —three-consonant 
syllables were used as stimuli. (b) 
Because the cards were shown manually 
the presentation and between-presenta- 
tion duration were variable whereas 
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Estes timed both of these intervals. (c) 
No omissions were allowed whereas 
Estes was forced to accept Some omis- 
sions because of the restricted intertrial 
interval. (d) The "rest" period between 
T, and Ts was longer than that allowed 
by Estes and occupied with a different 
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Fic. 2. Results of Experiment I and 
Experiment III (in parentheses). (Empir- 


ical values are proportions of quences m 
which correct, C, and incorrect, N, M ig 
on a given test trial were followec by 
C and N responses on the following test 


trial.) 
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activity. From Estes’ full report of his 
experiment (Estes, Hopkins, & Crothers, 
1960) it appears that the “rest” interval 
was brief (estimated at 114 minutes) and 
was used for a reinforced presentation 
of other items from the list. The prob- 
able unimportance of these procedural 
differences can be assessed from the 
results of Experiment I which are es- 
sentially the same as those obtained by 
Estes. Of the 53% of cases correct on 
T; 75% were correct on Tə (C-C) and 
of the 47% not correct on T, 81% were 
not correct on Ty (N-N) 


EXPERIMENT II 


The subjects for Experiment II were 
the same 27 women who served in Ex- 
periment I. Both experiments were con- 
ducted in the same Session, Experiment 
II first, and were separated by approxi- 
mately 7 minutes during which time the 
subject took part in the card-sorting 
experiment, The procedure was as de- 
Scribed for Experiment [ except that 
the test cards were reshuffled and re- 
presented immediately until four test 
trials were completed. The experimenter 
gave no indication beforehand that more 
than one test would be made, The 
stimuli were consonants, none of which 
appeared in Experiment T. The re- 
sponses were again the numbers 1 
through 8. 

The probability of recall on Ts, 
largely depends on whether the correct 
response is given on T, (see Figure 2), 
again confirming Estes' findings. How- 
ever, features of the results are difficult 
to reconcile with an all-or-none theory. 

Although the percentage of N-C cases 
(29%) is well within the range of 
chance variation, it is necessary to cx- 
plain the tendency to repeat those 
guesses on T} and Ty, ie, N-C-C and 
N-C-C-C cases. One would expect all- 
or-none theory to require the N-C-C 
and N-C-C.C values to approximate 
.29, ie, the Suessing rate established 
on T. The incremental interpretation 
demands that those values approximate 
.50, these cases being at threshold 
strength. The obtained values of .61 
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TABEE I 


Presps IN. CON asp N-C Cases as A FUxcros oF FORGETTING AND A 


Compartson oF N-C Vat 
OF INTE 


WITH THOSE 


ip ON THE Basis 


Percentage correct 


Experiment r- SERI, CN | N-C | 
= En | index Tz/Ti | Li 

|. | | | 
m 35 57 100 | —.19 | 3 | 2 
a .53 AS 25 19 ET 
Estes 49 .39 .80 .39 | .09 20 


and 47 are too high for either of these 
interpretations. These cases cannot be 
fully accounted for as part of a general 
tendency to repeat earlier calls on later 
trials, Of those who change their call 
on Ts, having been wrong on Ty, 61% 
of those correct repeat the second call 
once and 36% twice while of those 
again incorrect only 37% repeat the 
second call once and 14% twice. If 
the N-C cases are regarded as guesses it 
is necessary to explain why those calls 
are repeated with a much higher prob- 
ability than incorrect responses. If they 
are regarded as true recalls the expected 
N-C-C and N-C-C-C values are .50, or 
less (if forgetting occurs), but in fact 
the Values increase. 

„The implication is that there is some 
kind of feedback which distinguishes 
Past responses as correct or incorrect. 
One possible source of such information 
Under the conditions of Experiment lI 
55 the experimenter, who knows whether 
b response is correct or not. Experi- 

nt III was conducted to check this 
Possibility, A new group of 24 subjects 
drawn from the same population learned 
the items used in Experiment Il. One 
€xperimenter presented the paired-asso- 
Clates on R} and a second experimenter, 
ee Was unaware of the correct as- 
o sations; presented the test cards on T, 
ly. The results of Experiment II 
*Ppear in brackets in Figure 2. Again 
theigh proportion of calls correct for 
Te first time on T9 were repeated on 
i end T4,—a proportion considerably 
a of repetitions of incorrect calls. 
hose correct on Ta 53% repeat the 


response once and 40% twice, while of 


those incorrect only 32% repeat the call 
once and 12% twice. 

If the feedback does not originate 
from an unwitting experimenter then it 
must be somehow induced by the re- 
sponse itself. Bearing in mind that the 
associations in question are assumed to 
be at threshold strength and also the 
fact that recognition is easier than recall, 
it seems not unreasonable to suggest that 
in some cases the subject may recognize 
that the response she has just given as 
a guess was actually correct. Such an 
interpretation renders quite conceivable 
the notion that learning can occur on 
test trials. Estes et al. (1960) in fact 
noted such a phenomenon. However, the 
interpretation suggested does require 
that an association of liminal strength be 
set up on R, and is therefore incon- 
sistent with an all-or-none interpreta- 
tion. 

Further, comparison of the results of 
Estes’ experiment with those of Ex- 
periments II and III shows an effect 
which is also handled more readily by an 
incremental theory. If the total number 
of correct responses is taken as an index 
of retention then it is apparent that 
forgetting between T, and Tə is greatest 
in Estes’ experiment and least in Experi- 
ment III (see Table 1). The trend is 
consistent with the fact that Estes' sub- 
jects learned other associations between 
trials while the subjects in Experiment II 
sorted cards and in Experiment III there 
was no interpolated activity. From 
Table 1 it is also seen that as forgetting 
increases so the proportion of cases in 
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order of difficulty, each case being one 
d by both theoretical approaches, but also 


item for one subject. 


that the proportion in the N-C category 
decreases. A y? test based on the fre- 
quencies in the N-C categories showed 
the latter effect to be significant at the 
.001 level of confidence. The effect can- 
not be accounted for in terms of an in- 
crease in the population of responses 
among which the subject is guessing. 
The expected N-C values for intelligent 
guessing were calculated as follows. If 
C is the proportion correct on T}, c the 
probability that an item will be learned 
on R, and g the probability that the 
correct response will be guessed, then 
the equation for C (Estes et al., 1960) 
is: 


C=c+ (1- cog [1] 


Substituting the obtained value of C 
and an arbitrary value of g (with the 
limitation that g < C) into Equation 1, 
solve for c. A new estimate of g is 
obtained from: 
. 


$ 1 
* = ——— 
g n(1-— c) 


where ; is the number of items in the 
list. The new estimate of g is then 
substituted into Equation 1 and the 
procedure reiterated until a stable value 
of g is obtained. The g values obtained 
after three iterations are shown in 
Table 1. A y? test of the obtained N-C 
values against the frequencies predicted 
for intelligent guessing yielded a p of 
001, That is, the drop in the N-C fre- 
quencies as a fynction of forgetting ex- 
ceeds that expected simply as a decline 
in the chance level. 

The data of Experiment II strongly 
suggest that subclassifying the cases 
according to their history on successive 
trials does no more than sort them in 


The probability of 
appearance of a correct response on T4 
is: for cases formerly correct on all 
three trials (C-C-C cases), .90; for 
cases formerly correct on two of three 
occasions (C-C-N, C-N-C, and N-C-C), 
50, .38, and 7 respectively; correct 
on one of three occasions (C-N-N-, N- 
C-N, and N-N-C), .20, .18, and .09 
respectively; for those never before cor- 
rect (N-N-N), .05. A similar ordering 
can be observed in the data of Experi- 
ment III According to the all-or-none 
position the T, probabilities should be 
the same for all categories with the ex- 
ception of C-C.C, 

The writer appreciates Estes’ state- 
ment that he does not mean to offer the 
experiment he reports as crucial tests of 
incremental versus all-or-none theory 
and agrees with his thesis that theorists 
should examine closely the extent to 
which their concepts are artifacts of ex- 
perimental procedures. But insofar as 
the minature experiments are used as 
demonstrations, they are taken to create 
difficulties for incremental theory. Con- 
sidered in the context of additional in- 
formation from the experiments reported. 
in this paper these same data raise dif- 
ficulties for all-or-none theory. The 
criticism directed at the miniature ex- 
periments is not that they are too simple 
but that they are incomplete, 
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PSYCHOLOGICAL REVIEW 


ON THE STRUCTURE OF ASSOCIATIVE MEANING? 


JAMES DEESE 
Johns Hopkins University 


Ever since Hobbes wrote “From St. 
Andrew the mind runneth to St. Peter 
and from St. Peter to Stone,” psycholo- 
gists and philosophers concerned with 
the nature of the mind have been trying 
to describe the nature of the associa- 
tions that are so evident in human 
verbal behavior. Locke tried to picture 
the Structure of the human mind by 
describing relations between associa- 
Hons. He attributed associations to the 
More or less accidental contingencies of 
perceptual qualities in the real world. 
By so doing, Locke became responsible 
for a major tradition in the theory of 
both thought and perception. 

Almost all of the thedretical attempts 
to deal with association stem from 
COcke, For experimental psychologists 
Mterested in learning, however, the 
Most important theoretical contribu- 
tions come from the so-called secondary 
aws of association, introduced by 

homas Brown and David Hartley. 

Ssociations themselves are supposed 
to arise by contiguity, similarity, etc., 
bat they occur in the strengths and dis- 
tributions that they do because of their 
Frequency, vividness, and so on. It is 

Perimental treatment of the sec- 
°ndary principles that has led to most 
Of the stable empirical generalizations 
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about associative learning, though the 
primary "laws" have always been of 
greatest theoretical importance. 

Since Ebbinghaus, the major experi- 
mental treatment of association has 
been with artificial material, mostly 
nonsense syllables. Consequently the 
relations of contiguity, similarity, etc., 
have been arranged by experimenters, 
not obtained from the natural relations 
among linguistic elements. 

Somewhat earlier than. Ebbinghaus, 
however, Sir Francis Galton inaugu- 
rated the study of naturally occurring 
associations. For the most part, the 
tradition begun by Galton has remained 
psychometric rather than experimental. 
Recently, these two traditions have been 
combined. Nevertheless, most of the 
attempts to deal with the structure or 
organization of naturally occurring as- _ 
sociations have been empirical rather 
than theoretical, in the tradition of 
Locke. . 

Most of the empirical studies of the 
organization of naturally occurring as- 
sociations have been studies of classifi- 
cation of associations. Associations are 
classified by logical relations or by the 
dictionary meanings of words which 
occur as stimuli or as responses in free 
association tests. 

A good summary of most of the at- 
tempts to discover the structure of asso- 
ciations by classification can be found 
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in the first edition of Woodworth’s Ex- 
perimental Psychology (1938). Wood- 
worth both reviews a half century of 
German, American, and English at- 
tempts at classification and presents 
a classification scheme of his own. 
Woodworth's own classification is more 
obviously rooted in the association fre- 
quencies themselves than are the earlier 
ones. It is also unique in that it is a 
two dimensional classification. Wood- 
worth classifies both by meaning, in the 
traditional sense, and by meaningful- 
ness (in the sense used by Noble and 
others). 

As with the earlier classifications, 
Woodworth's is a combination of dic- 
tionary and grammar book rules, fil- 
tered through Woodworth's own verbal 
behavior and the verbal behavior of the 
investigators whose work he summa- 
rizes. The trouble with it and with all 
classifications to date is that they are 
not completely rooted in the association 
process itself. They attempt to impose 
upon the associations the logical rela- 
tions found in grammars and the rela- 
tions between words stated in diction- 
aries and thesaureses as well as some 
of the relations among objects per- 
ceived in the natural world. Perhaps 
the major result of such classifications 
is to convince us that while associations 
are partly related to the meanings of 
words in all of the above senses, associ- 
ations are meaningful in precisely none 
of the above senses. 

The meaningfulness of words refers 
to organized relations among the words 
and among words and objects in the 
natural world. Associations are re- 
lated to one another in some organized 
fashion and also related in some way to 
perceived properties in the natural 
world. The only conclusion is that 
there is some sense in which one may 
speak of "associative meaning and that 
the way to discover the associative 
meaning of words is not to classify as- 
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sociations according to other senses of 
meaning but to discover the relation- 
ships associations have with one an- 
other. That is the purpose of the 
present paper. 

Aside from the difficulty summarized 
above, all previous attempts at classi- 
fication, including the interesting one 
proposed by Karwoski and Berthold 
(1945), have suffered from another de- 
fect. All schemes proposed thus far 
have attempted to deal simultaneously 
with the stimuli and the responses to 
them. That is to say, they attempt to 
classify the relation between stimuli and 
responses, even though the individual 
members vary. Thus, the result fre- 
quently is an attempt to equate such 
pairs as sLow-Fast and piur-Yellow.? 
Apart from the implicit assumptions 
made by collapsing disparate stimulus 
and response terms together, the possi- 
bility is ignored that certain reciprocal 
pairs, for example, WoRK-Play, PLAY- 
Work, bear no simple consistent rela- 
tion to one another. 

For this reason, it seems more sen- 
sible to simplify the problem by dealing 
with the distribution of responses with 
stimuli held constant or with the distri- 
bution of stimuli for any given response. 
The distributions to different stimuli or 
the stimuli for pairs of responses may 
then be compared. This comparison 
should provide the first step in under- 


Standing how associations are related 
to one another, 


RELATIONSHIPS AMONG ASSOCIATES 


For the past several years, the pres- 
ent author has been studying the extent 
to which the relations among associa- 
tions determine such aspects of immedi- 
ate free recall as clustering, importations 
into recall, etc. (Deese, 19593, 1959b, 
1960). This work had its origin in 


2 In this paper, free association stimuli will 


be given in small capitals and responses in 
italics, 


tS "Áo 


On THE STRUCTURE OF ASSOCIATIVE MEANING 163 


the studies of associative clustering 
(Jenkins & Russell, 1952) and category 
clustering (Bousfield, 1953). The basic 
method has been to relate an index 
based upon the average frequency with 
which all words presented for recall 
tend to elicit one another as free as- 
sociates with the number of words 
correctly recalled, the number of im- 
portations, etc. This index is a power- 
ful predictor of a number of character- 
istics of free recall. Free recall itself 
is a representative verbal process, and 
thus it is evident that the associative in- 
terrelations among words are powerful 
determiners of the pattern and fre- 
quency with which words are emitted. 
^ The interword associative index is 
computed by tabulating the frequencies 
with which each word in a particular 
collection (usually a list of words to be 
presented for recall) occurs as a re- 
sponse to all of the other words as free 
association stimuli. Casual inspection 
of the matrix that provides the basis for 
such a tabulation gives striking con- 
firmation of what every investigator 
who has studied free association has 
felt to be the case, namely that associa- 
tions exist in well organized and in 
some instances tightly organized net- 
works. 'The networks are so apparent 
that they turn up even in random col- 
lections of words, if the words are high 
in frequency of usage (Deese, 1960). 

The networks that appear upon such 
a tabulation provide only part of the 
Picture, however, because the words 
that appear as responses are artificially 
restricted to words which appear as 
stimuli, When examined, the original 
association frequencies show that stim- 
uli which do not elicit one another as 
responses sometimes have in common 
a great many responses. An example, 
picked more or less at random, is the 
Pair PIANO and SYMPHONY. Ina sam- 
Ple of Johns Hopkins University under- 
Sraduates, neither one elicited the other, 


but both elicited Note, Song, Sound, 
Noise, Music, and Orchestra in varying 
frequencies. 

Thus, the relationship between free 
association. stimuli should be deter- 
mined from all of the responses that 
stimuli have in common. If the asso- 
ciative meaning of any stimulus is given 
by the distribution of responses to that 
stimulus, then two stimuli may be said 
to have the same associative meaning 
when the distribution of associates to 
them is identical. Two stimuli overlap 
or resemble one another in associative 
meaning to the extent that they have 
the same distribution of associates. 

One may also deal with the associa- 
tive distribution ? of the responses them- 
selves. The associative distribution of 
a response is the collection of stimuli to 
which it occurs. This relation is less 
interesting than that between responses 
to stimuli for a number of reasons. It 
does not, of course, specify an associa- 
tive concept as does the distribution of 
responses to a stimulus, and it is sub- 
ject to serious sampling limitations. 
One is fairly sure to pick up any rela- 
tions of statistical significance among 
stimuli by a modest sample of re- 
sponses. With the distribution of stim- 
uli problem, however, there is no way 
to insure that one has sampled all stim- 
uli that are high frequency producers of 
a particular response short of exhaust- 
ing the language. Therefore, except 
insofar as the distribution of stimuli 
problem is related to the problem of 
associative meaning, it will be ignored 
in the balance of this paper. 


SPECIFICATION OF RELATIONSHIPS 
= 


The idea of associative meaning pre- 
sented above implies that one describes 


3 The word “distribution” is used here in 
analogy to its linguistic, not its statistical 
sense. That is to say it indicates the number 
of situations to which a particular word 
Occurs as a response. 
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and classifies associations purely in 
terms of the relations among the re- 
sponses to different stimuli. There are 
several ways in which such relations 
may be specified. One may count the 
number of associations in common in 
any fixed size sample, or one may 
weigh the common associations by their 
frequency. For prediction of effects in 
verbal behavior, the latter procedure is 
probably more precise, and it is the ap- 
proach taken in the present paper. 
Also, one may wish to describe the 
ways in which stimuli resemble one 
another in associative meaning. This 
implies the study of a collection of re- 
lations between a number of stimuli. 
Factor analysis suggests itself as a way 
of reducing the structure within such a 
collection to manageable proportions. 
In any event, the heuristic value of the 
concept of associative meaning will de- 
pend upon ways of specifying the rela- 
tions between free association stimuli. 
The purpose of this section is to pro- 
vide a basic method for measuring the 
relationship. 

There are several difficulties that 
stand between the definition of associa- 
tive meaning and the specification of 
particular associative meanings by a 
study of the relationships among free 
associations. "The first of those is cre- 
ated by the technique of collecting free 
associations, which almost completely 
eliminates the occurrence of the stimu- 
lus word as a response to itself. This 
problem must be dealt with first, 

The first step in an analysis of asso- 
ciations is to tabulate for each stimulus 
word in a collection the responses 
which the word has in common with 
any other stimulus words in the collec- 
tion. Since, by instructions as well as 
by technique a word usually does not 
occur as a response to itself in free as- 
sociation, the cells in such a tabulation 

in which the same word. as response 
and stimulus intersect will be empty. 
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This creates intuitive difficulties if one 
defines the relation of associative mean- 
ing between words as the common dis- 
tribution of responses, because pairs of 
words often appear to be intuitively re- 
lated when they have no overt responses 
in common. Upon close examination 
all such intuitive cases turn out to be 
cases in which the responses to one 
word are the stimuli to the other. An 
example in the Hopkins data is the 
SOFT-LoUD pair. Both of these words 
elicit one another, but they have no 
other responses in common. Thus, 
they could have no associative meaning 
in common if the possibility of a word 
occurring as a response to itself is 
denied. 

Fortunately, a way out of this para- 
doxical situation already exists in a 
technique independently designed for 
other purposes by Jenkins and Cofer 
(1957) and by Bousfield, Cohen, and 
Whitmarsh (1958), The technique 
consists of assuming that a word elicits 
itself implicitly as an associative 100% 
of the time. This assumed implicit re- 
sponse is called by Bousfield, Cohen, 
and Whitmarsh the “representational” 
response. The implication of the name 
is that this response identifies or repre- 
sents the symbolic stimulus presented 
to the individual. 

Both Jenkins and Cofer (who do not 
explicitly identify the representational 
response) and Bousfield, Cohen, and 
Whitmarsh implicitly make the assump- 
tion that the representational response 
is always the same word as the stimulus 
presented. This assumption is very 
nearly correct for words of common 
cultural use, but it must surely be 
incorrect for unusual words. Many 
Hopkins undergraduates, for example, 
respond to the words AnBEss and ABYSS 
in precisely the same way, which sug- 
gests that the representational response 
for one of them is not the dictionary 
equivalent of the word. Nevertheless, 


ON THE STRUCTURE OF ASSOCIATIVE MEANING 165 


this assumption works with only trivial 
exceptions for moderately sophisticated 
individuals and words of ordinary íre- 
quency of usage. It is, therefore, 
adopted here. 

With this assumption it is possible to 
prepare a table like Table 1. This table 
shows the frequency of responses com- 
mon to any two stimuli, with the fre- 
quency of the stimulus word as a 
response to itself entered with 100% 
frequency. Since the N for this table 
is 50, the value entered in the stimulus- 
response intersection cells is 50. Note 
that it is not necessary for all the 
stimulus-response intersections to ap- 
pear. If a particular stimulus word 
never appears as a response to the other 
stimuli, it will not appear as a common 
response. Such instances are rare, 
however, when the collection of stimuli 
1s made on some basis that predicts re- 
lated associative meaning. 

The second problem is to find an 
appropriate measure of the relation be- 
tween the responses common to differ- 
ent stimuli. This is the problem of 
finding a measure of the relations be- 
tween the columns in Table 1. The 
most obvious solution is to use product- 
moment correlations as the measure of 
relations between columns. There are, 
however, a number of objections to 
Such a solution. 

The most important objection, de- 
termined in part by the fact that the 
technique of free association limits 
each subject to one response, is that 
the product-moment correlation would 
yield a measure of the correlation of 
distributions of associations rather than 
the extent to which frequencies are in 
common. Often, responses that are 
high in frequency to one stimulus will 
be low in frequency to another. This 
happens very often because of the 
typically steep rank-frequency distribu- 
tion of free association responses to any 
Siven stimulus. Because the forms of 


the distributions of common responses 
may be inverse to one another, the ob- 
tained product-moment correlations of 
the frequencies will often be negative. 
Thus, a table of product-moment corre- 
lations obtained from Table 1 would 
yield a few zero correlations, a few high 
positive correlations and a large num- 
ber of low negative correlations. 

Since it is not the relation between 
the forms of the distributions of re- 
sponses but the relations between the 
frequencies themselves that is inter- 
esting, a direct measure of overlapping 
frequency is appropriate. To measure 
the commonness of associative meaning, 
we may take the ratio of the sum of the 
overlapping frequencies to any pair of 
stimuli to the maximum possible sum. 
Thus, if we turn to the first two col- 
umns in Table 1, which are the responses 
to morm and INsEcT respectively, 
we see that the response frequencies in 
common sum to 12. The total possible 
frequency in common is 100; this yields 
a relative common frequency of .12. 
This number is entered into the ap- 
propriate cell in Table 2. 

A simple interpretation of such a 
measure demands some assumptions 
which are not strictly correct. In other 
words, for most problems, this measure 
can be regarded as an approximation. 
It supposes that the distribution of re- 
sponses taken one per individual per 
stimulus word from a homogenous 
population is characteristic of the dis- 
tribution of associations at different 
times in any one representative indi- 
vidual from that population. Thus, the 
assumption is made that the distribution 
of responses obtained from a sample of 
individuals is the same as the dis- 
tribution of associations in any one 
individual. 

This is an assumption shared by most 
students of free associations in popula- 
tion samples (and one only occasionally 
explicitly stated). An experimental 
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comparison of repeated testing vs. counting instances of commonness of 
single testing is a badly needed addition association. " 

to the association literature, and it may This measure of the relation in asso- 
well lead to a more satisfactory way of ciative meaning between two words can 


TABLE 1 


Raw FREQUENCIES OF ASSOCIATES IN COMMON TO 19 Worps 
BasED ON RESPONSES oF 50 SUBJECTS 


Stimulus words% 
Responses 


1 2|3|4|5|6|7/|8]|9 ]10] 11] 12} 13] 14 15 | 16 | 17 | 18 | 19 


Moth 50 

Insect 1 (50 3 
Wings 2 50 

Bird 25 |5 
Fly 10 | 9 |12 |1 
Yellow 50 | 2 
Flower 2 [50 
Bug 24 4 50| 5 1 
Cocoon 50 
10 Color 
11 Blue ifi 
12 Bees 1 212 
13 Summer 2 
14 Sunshine ‘ , ik 3 à 12 
15 Garden 6 50 

16 Sky 1 6 1 EO 
17 Nature 

18 Spring 3 50 * 
19 Butterfly 0 
20 Light d 1 " a 50 
21 Pretty 3 1 
22 Ant 3 1 
23 Bright 1 4 
24 Airplane 1 

25 Feather 
26 Flight 
27 Tree i " 
28 Plane g 5 

29 Red 
30 White 
31 Green 
32 Sun 
33 Beetle 1 1 
34 Spider 1 1 
35 Gold 1 
36 Black 1 1 8|2 
37 Winter 
38 Warm 
39 Plant 2 
40 Gray 1 112 
41 Brown 1 1 
42 Vacation 2 1 


0-0 U0ot-^ 
ac 
e 
N 
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t2 
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a The numbers of the stimulus words correspond to the first 19 response words. 
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TABLE 2 


OVERLAP COEFFICIENTS FoR COMMON ASSOCIATES BETWE 
(Decimals Omitted) 


THE 19 Wonps IN TABLE 1 


Stimulus 


Stimulus words 


words 


100 | 12 | 12 | 12 | 11 | 02 | 00 | 05 
100 | 09 | 09 
100 | 44 | 19 | 00 | 00 | 03 


d FLOWER 100 | 02 
BUG 100 
9 cocoon 
10 coLor 

11 BLUR 

12 BEES 

13 SUMMER 
14 SUNSHINE 
15 GARDEN 
16 sky 

17 NATURE 
18 sPRING 
19 BUTTERFLY 


vary from .00 to 1.00. The latter value 
would happen, however, only if the two 
Stimuli produce completely reciprocal 
responding. Thus, complete identity in 
associative meaning can occur by this 
measure only if two stimulus words 
always elicit each other and no other 
Words. This is a consequence of the 
assumption of the frequency of the rep- 
Tesentational response. Thus, if a pair 
of stimuli had distributions of responses 
exactly alike except that they never 
elicited each other, the maximum rela- 
tive frequency in common would be .5. 
Such a result appears at first to be a 
Very undesirable property of the meas- 
ure of relation in associative meaning. 
However, it turns out, as will be 
Pointed out later, that this property is 
extremely important in generating some 
Useful consequences of the concept of 
associative meaning. 

The relative overlap coefficients for 
all pairs of stimuli within a set produces 
à matrix of the sort found in Table 2. 
The only problem remaining is to fill 
in the principal diagonal, or the com- 
Munalities in an ordinary table of corre- 
ations, In keeping with the assump- 
tions made above about the assessment 


of associative meaning, the proper value 
to enter into these cells is 1.0, since any 
word always has the same distribution 
of response frequencies as itself. This 
is like the assumption introduced by 
Osgood and Suci (1952) into their 
factorial portrayal of the semantic 
differential. 


ANALYSIS OF ASSOCIATIVE MEANING 


Before examining some of the impli- 
cations of this notion of associative 
meaning, it would be helpful to look at 
some sample data showing some of the 
relations that can exist between the 
associative meanings of different words. 

It would be possible, of course, to 
apply the present analysis to random 
collections of words, but because the 
overlap in associative meaning between 
words taken at random is not high, 
such an application would not be re- 
warding at this stage. In general, an 
analysis of associative meaning would 
be conducted on a set of stimulus words 
selected because we would be interested 
in discovering their relationships and 
because we would suspect, from prior 
information, that they were associa- 
tively related. 
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The data presented here illustrating 
some of the possible relations in asso- 
ciative meaning are not psychologically 
interesting, with one exception. They 
were, however, selected in such a way 
as to insure some degree of related 
associative meaning; thus they illus- 
trate fairly well some of the analytic 
possibilities. 

The sets of stimulus words in Tables 
3, 4, and 5 have internal structure for 
the simple reason that the stimulus 
words themselves are all responses to 
one of the stimulus words by the 
Minnesota norms (Russell & Jenkins, 
1954). For example, the words in 
Table 3 include BUTTERFLY and words 
which are responses to BUTTERFLY in 
the Minnesota norms. 

These data are based upon samples 
of 50 subjects each, which are a bare 
minimum for any kind of reliable an- 
alysis and too small for anything but 
the roughest kind of prediction. They 
serve, however, to illustrate how sets of 
words are related to one another and 
what the factor structure of associations 
is like. The subjects from which these 
associations were obtained are under- 
graduate students (male) at the Johns 
Hopkins University, 

The basic data from which the fac- 
tors in Table 3 are derived are in Table 
2. These are relative common frequen- 
cies of responses to 19 words in the 
BUTTERFLY set. These relative fre- 
quencies were factored by the centroid 
method, and after a very few rotations 
(pairs of factors were rotated through 
approximately 45 degrees) the factor 
loadings presented in Table 3 were ob- 
tained. The factor loadings in Table 4, 
based upon stimulus words obtained 
from music, and Table 5, based upon 
stimulus words obtained from SLOW 

were likewise arrived at by centroid 
analysis and very simple rotation. 

Since the major emphasis in these 


James DEESE 


sample data is on the factor presenta- 
tion, a word needs to be said about 
factor analysis in this context. Factors, 
of course, depend upon the variables 
used to obtain them. In testing theory 
this is a sometimes overlooked restric- 
tion if for no other reason than that test 
theorists are often convinced that the 
batteries they factor are in some sense 
“representative” of all the set of abili- 
ties they wish to factor. In the analysis 
of words, however, this is not the case, 
So we are forced to an awareness of the 
limitations of factor extraction. The 
words that go into any matrix are only 
a small part of those that might possibly 
have been used. Thus, the results of 
the factor analysis are properties of the 
limited set, and we cannot talk about 
the factor loadings for any one word, 
for example, as if these exhausted all of 
the associative structure of that word. 
As it turns out, this is a very useful 
property of the analysis of associative 
relations among words, but it tends to 
make trivial collections of words, such 
as those in Tables 3, 4, and 5 less inter- 
esting in a factor analysis than would 
be collections of words with some 
Suspected important structure to be 
uncovered, 

The first thing to notice about the 
factor loadings is the convenient simple 
Structure in the factors. In Table 3, 
approximately half of the words have 
nearly zero loadings on Factor I, while 
the other half of the words have posi- 
tive loadings of about the same magni- 
tude. Those words on Factor I that 
have approximately zero loadings have 
positive loadings on Factor II, and the 
words with positive loadings on I have 
nearly zero loadings on II. Thus, the 
first two factors effectively sort the 
stimulus words into two classes, 

This kind of initial classification is a 
property of the kind of distribution of 
common frequencies found in Table 2. 


ON THE STRUCTURE OF ASSOCIATIVE MEANING 169 


and since this distribution is typical of 
all of the 15 or so matrices thus far 
examined by the author, it will probably 
occur in all centroid analyses of words. 
The same initial classification occurs 
in Tables 4 and 5, and it also occurs in 
Table 6 for which the stimulus words 
were not selected for associative rela- 
tions but because they centered about a 
particular attitude or value system. 

An inspection of the loadings of Fac- 
tor I in Table 3 shows that it occurs in 
words having to do with animate crea- 
tion (BEES, FLY, BUG, WING, BIRD, etc.) 
whereas the words with loadings on 
Factor II are those words in the set 
that do not have to do with animate 
things (sky, YELLOW, NATURE, etc.). 
It would be a mistake, however, to fall 
into the practice of naming factors de- 
rived from associative distributions, for 
inevitably we fall into the attempt to 


TABLE 3 
Roratep CENTROID FACTOR LOADINGS OF 
STIMULUS OVERLAP COEFFICIENTS 
PRESENTED IN TABLE 2 
(Decimals Omitted) 


Factors 
Words 

I om II IV V VI 
Moth 44 03 —27 —01 —03 —32 
Insect 50 01 —33 01 —34 11 
Wing 52 01 45 OL 29 —07 
Bird 52 02 46 O1 29 —07 
Fly 48 03 32 (01 —28 —03 
Yellow 01 44 —03 34 —32 -02 
Flower 01 39 —03 —32 03 44 
Bug 41 01 —341 00 -14 37 
Cocoon 40 01 —35 00 25 02 
Color —02 42 —04 44 04 —04 
Blue —02 57 —04 52 23 —04 
Bees 36 04 34 —02 —30 00 
Summer —01 31 —03 —34 —02 —34 
Sunshine 02 37 —04 —33 —03 -—35 
Garden 00 35 —02 —34 —03 44 
Sky —01 41 —03 43 38 —07 
Nature 04 31 29 —02 01 34 
Spring —01 35 —03 —37 —02 —36 
Butterfly | 48 06 —29 —01 26 01 


impose a logical structure on the asso- 
ciations, and one thing that is clear is 
that associative meaning is not logical. 
Nevertheless, it is clear that the first 
two factors have split the stimulus 
words into the most obvious and basic 
structure organizing them. 

Factor III in Table 3 shows zero 
loadings on the nonanimate words 
again (except for NATURE, which ap- 
parently comes close to straddling the 
basic division of words in Table 3), 
and it splits the animate words into a 
bipolar factor. The positive loadings 
are on WING, BIRDS, BEES, FLY and the 
negative loadings on BUG, COCOON, 
MOTH, BUTTERFLY (which probably oc- 
curs here because of its strong relation 
to BUG and cocoon). Factor IV makes 
a bipolar split of the nonanimate words. 
SUMMER, SUNSHINE, GARDEN, FLOWER, 
and SPRING go together while BLUE, 
SKY, YELLOW, and COLOR go together. 

It is instructive to follow the factor 
profiles for pairs of words within a set. 
This allows us one of several ways of 
preparing an associative thesaurus, in 
this case a pair thesaurus. Thus, we 
may say that BLUE and vELLOW are 
both alike within this set in that they 
share common loadings with sky, SUM- 
MER, SUNSHINE, COLOR, etc. On one 
factor, however, they diverge; this pro- 
vides the critical difference between 
BLUE and yELLow within this set of 
words. BLUE goes with SKY, BUTTER- 
FLY, WING, and BIRD, while YELLOW 
goes with INSECT, FLY, BUG, and BEES. 
This, of course, does not exhaust the 
contrasts in associative meaning be- 
tween BLUE and YELLOW, but it does 
suggest one way in which they differ. 

Table 4 presents the first six factors 
from a centroid analysis of the matrix 
of stimulus words which are themselves 
responses to Music. Again, the first 
two factors yield a basic split. Factor I 
has loadings on TONE, INSTRUMENT, 
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TABLE 4 


ROTATED CENTROID Factor LoADINGS OF 
STIMULUS OVERLAP COEFFICIENTS FOR 
17 Worps DERIVED FROM MUSIC 
(Decimals Omitted) 


Factors 
Words 

I I Ill IV X VI 
Tone 19 43 —03 40 02 53 
Instrument|43 —01 33 01 —31 01 
Symphony |49 06 34 01 43 00 
Sing 41 —01 —53 02 —36 —01 
Note 43 01 —33 03 37 —01 
Song 55 —01 —47 00 —31 01 
Sound 04 39 00 —26 41 07 
Piano 52 02 37 01 —24 00 
Noise 06 59 01 06 02 31 
Band 51 08 38 —04 —24 -—03 
Horn 37 —09 00 —32 00 —43 
Loud 00 48 —01 —42 —01 —29 
Hear 06 60 —01 49 01 —18 
Opera 51 03 —33 01 32 00 
Ear 00 41 —01 52 —02 —36 
Music 58 08 25 01 37 02 
Soft 07 23 —02 —01 00 07 


SYMPHONY, SING, NOTE, SONG, PIANO, 
BAND, HORN, OPERA, and MUSIC, while 
Factor II has loadings on sounD, 
NOISE, LOUD, EAR, and HEAR (this is a 
less clean split than the previous table 
because of the small loading of TONE 
on Factor I). As with the previous 
table, Factors III through VI are bi- 
polar. Again, interesting contrasts may 
be seen in factor profiles. OPERA and 
SYMPHONY, for example, have parallel 
profiles except on Factor III where they 
are bipolar opposites. Here, OPERA 
goes with SONG, NOTE, and SING, while 
SYMPHONY goes with INSTRUMENT, 
PIANO, BAND, and MUSIC. 

Table 5 exhibits exactly the same sort 
of pattern with the words centering 
around stow and so will not be 
commented on further. This table 
illustrates an effect to be found in asso- 
ciations in adults, the close relations of 
opposites. The pair SLOW-FAST yield 


an overlap index of 71 for the Hopkins 
population, the highest for any of the 
words in Tables 3, 4, and 5. For the 
six factors extracted, the factor profiles 
for these two words are practically 
identical. This means that there is not 
enough contrast provided within the 
matrix of words to separate these words 
within the arbitrary limit of six factors. 
This is in contrast with the pair sorr- 
LOUD, which separates almost immedi- 
ately in the music table. The presence 
of NOISE, HEAR, HORN, etc., managed to 
pull sorr and Loup apart by providing 
Some associative meaning overlapping 
with Loup but not with sor. 

Because of the close relation be- 
tween stow and rasr, it may be very 
difficult to provide sets which will con- 
trast these in associative meaning. They 
are, as are most of the high frequency 
opposites, adjectives, and they are de- 
scriptive of a common state modified 
only in a kind of secondary way, That 


TABLE 5 


ROTATED CENTROID LOADING or STIMULUS 
OVERLAP COEFFICIENTS OF 16 Wonps 
DERIVED FROM SLOW 
(Decimals Omitted) 


Factors 
Words 
I II HI IV v VI 
Slow 67 00 39 —01 20 —01 


Walk 41 02 —S5 00 —38 03 
Speed 19 09 —33 07 47 -04 
Quick 61 00 25 —02 —07 —28 


Lazy 04 35 05 —38 —05 —36 
Drive 10 39 —33 02 00 04 
Skid 05 40 —01 36 35 —01 
Run 54 01 —52 —01 31 —01 
Work 03 33 36 —01 05 —36 
Fast 733 01 38 —02 20 —02 
Down 01 31 —01 —36 —07 —37 
Stop 001 37 05 37 —09 36 
Snail 43 02 37 02 —10 33 
Sign 00 36 07 07 02 —44 
Poke 02 34 05 —37 —05 35 


Traffic 04 36 06 33 —04 
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is to say, both words imply motion; 
they both describe the rapidity of mo- 
tion, and they contrast only in the kind 
of rapidity. The way in which sort 
and Loup are alike, however, provides 
only part of the distribution for sort, 
since it exists also in a close relation 
with the word HARD. HARD and Loup 
are related through sort but otherwise 
independent. 

Table 6 illustrates the factor analysis 
of a set of stimulus words which are 
psychologically more interesting. These 
words are taken from a larger study, to 
be presented elsewhere, which is con- 
cerned with the factor structure of asso- 
Ciative meaning of critical words for 
subjects scoring high and subjects scor- 
ing low in each of the value scales from 
the Allport, Vernon, and Lindzey 
(1960) study of values inventory. The 
critical words for each value scale were 
obtained from the actual items scored 
for that value scale; thus the words in 
Table 6 are from the Religious Value 
Items. 


TABLE 6 


ROTATED CrNTROID LOADINGS OF STIMULUS 
OVERLAP COEFFICIENTS FOR 15 WORDS 
CENTERING IN RELIGIOUS VALUE 


Factors 
Words 

I II II IV v VI 
Sermon 03 43 38 —10 04 33 
Clergyman | 11 42 05 —47 00 26 
Religion 54 03 —02 —34 —03 —36 
Service —01 40 —02 —43 37 —16 
Worship 57 04 33 —05 —30 —02 
Reverence 55 04 36 02 27 —03 
rayer 54 03 34 —03 —01 —34 
Soul 06 38 —02 34 03 39 
Spirit 39 00 —37 05 39 05 
Divine so 00 05 35 34 04 
Faith 52 08 —36 —04 —35 01 
Inspire 03 40 —01 33 —01 —41 
Devotion 49 10 —33 02 00 34 
ove 03 38 00 35 36 05 
Hope 06 37 —41 —01 —04 —45 
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Completed data are available at pres- 
ent only on high scorers (the distribu- 
tion of scores in the Hopkins population 
is skewed). Thus, the factors in Table 
6 are based upon 50 Hopkins under- 
graduates who scored at or above one 
probable error in the published norms 
for men on religious value. The first 
thing to notice about Table 6 is that it 
is very much the same as Tables 3, 4, 
and 5. The initial classification in this 
case sorts the words into a group in- 
cluding DIVINE, SPIRIT, PRAYER, WOR- 
SHIP, REVERENCE, RELIGION, FAITH, 
and pEVOTION and into a group includ- 
ing HOPE, SOUL, LOVE, INSPIRE, SERVICE, 
SERMON, and CLERGYMAN. One is al- 
most tempted to say that the classifica- 
tion divides the words into an institu- 
tional, theological class and a human, 
individual class. 


IMPLICATIONS OF ASSOCIATIVE 
MEANING 


By now, some readers will have 
recognized the notions presented here 
as extensions of those stated by Bous- 
field, Cohen, and Whitmarsh (1958), 
and since these authors regard their 
work as an extension of Noble's 
(1952), by implication the present 
work is also. 

Thus far we have asserted that the 
distribution of responses to any free 
association stimulus * forms the associ- 
ative meaning of the stimulus word and 
thus an associative concept named by 
that word. If the associative meaning 
of a word (as opposed to any particu- 
lar stimulus-response pair) is to have 
any use, it must predict something 
about verbal behavior. In this section 


4 Free association stimuli are not limited 
to words, of course, and the ideas presented 
here can be in theory, extended to nonverbal 
stimuli, but the lack of a clear criterion for 
the representational response may produce 
difficulties in practice. 
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we will discuss some of the character- 
istics of verbal behavior that may be 
predictable from the associative. mean- 
ings of words and from collections of 
associative meanings. 

Associative meaning will not predict 
the tendency of words to elicit one an- 
other; that is predicted by the free as- 
sociation frequencies of particular pairs 
or by some general measure of pair re- 
lation (such as the interword associa- 
tive index). Associative meaning, in 
general, should predict the words that 
will occur in the verbal environment of 
a particular word. This statement en- 
tails the assumption that words are 
used, subject to certain other con- 
straints, in particular environments be- 
cause of the distribution of associations 
they possess. Thus, the distribution of 
associates takes on the character of 
mediation determining the use of par- 
ticular words. 

Thus, if a particular word appears in 
some verbal environment, a word close 
to it in associative meaning should ap- 
pear in the same environment. Gener- 
ally, the closer the relation, the higher 
the probability of the two words appear- 
ing in the same environment. The 
words may appear in the same environ- 
ment in two ways: (a) as substitutes for 
one another, or (b) as part of one an- 
other's environment. It is nota part of 
associative meaning to predict which of 
these two ways will appear; though 
such a prediction should arise out of 
theory about the origins of associations. 

The first kind of environmental rela- 
tion (that of substitution) is illustrated 
quite clearly by the highly related 
SLOW-FAST pair. The environment that 
produces Slow, all other things equal, 
is very likely to produce Fast also, but 

not nearly so likely to produce diction- 
ary or thesaurus equivalents. of Fast. 
Thus, we are likely to say tis. ds a 
slow train" or "this is a fast train," but 


we are unlikely to say "this is a rapid 
train" or "this is a sluggish train." 

The second kind of environmental 
relation (that of coordination) is illus- 
trated by the sky-BLUE pair. Given the 
frame, “he into the 
sky,” a highly likely substitution in the 
second blank is “blue.” All words in 
English seem to be able to produce both 
kinds of environmental relation, but for 
many words one or the other type 
predominates. Thus, while “the blue” 
may appear as a substitution for “sky,” 
such useage is poetic and rare. 

One problem is produced by the ap- 
parent assymetry of substitution in par- 
ticular frames. It is possible, indeed 
probable, that the frame, “he 
into the blue — ——," will not pro- 
duce "sky" with the Same frequency 
that "blue" is produced in its comple- 
mentary frame, Particular test frames 
have an influence, and it is possible 
that substitution frequencies may be 
Symmetrical when averaged over a rep- 
resentative group of test frames. Such 
Symmetry isa Property of the definition 
of associative meaning, though the 
property cannot be properly tested, 
since any failure of Symmetry may 
easily be attributed to a lack of ade- 
quate sampling. 

Earlier it was pointed out that one of 
the consequences of the assumption of 
the representational response was that 
perfect agreement in associative mean- 
ing between two words would occur 
only if they were perfectly reciprocal, 
that Is to say, if they were the only 
associates of each other, While this 
seemed at first to be an undesirable 
Property, it does conform with the 
limiting case of substitution. For, if 
two words are perfectly correlated in 
associative meaning, then they ought to 
be the only substitutes for one another 
in the substitution relation. Notice 
that this situation would not theoreti- 
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cally prevent these words from being 
part of the associative meaning of a 
third word. Thus Word A could elicit 
only B as a free associate and B elicit 
A, which would make them overlap 
completely in associative meaning, but 
both A and B could be free association 
responses to a third Word C. Obvi- 
ously this situation would not happen 
m any real language, but an approxi- 
mation to it frequently occurs. The 
general rule for such a case is that the 
reciprocal pair of words are both of 
high frequency of usage, while the third 
word, which elicits them both but never 
Occurs as a response to either, is a 
word of very low frequency of usage. 
An example in the Hopkins data is 
the overlapped pair WOMAN and GIRL, 
neither of which elicit Virginal, but 
both of which are elicited as responses 
by VIRGINAL. 

The use of test frames to validate the 
notion of associative meaning will be 
difficult in practice, since the influence 
of grammatical constraints, etc., needs 
to be removed. Furthermore, we do 
not know enough about the problem of 
Compounding to know whether or not 
compound effects can be derived from 
the concept of associative meaning it- 
self. Nevertheless, whatever other in- 
teresting effects may be achieved, the 
ultimate validation of the concept of 
associative meaning must come from its 
ability to predict substitutions in the 
flow of language. 

The most important use of the notion 
Of associative meaning is to study the 
Structure of sets of words having intrin- 
Sic psychological interest. The number 
Of such sets is very large, and indeed is 
only limited by the ideas psychologists 
have about the influence of psychologi- 
Cal variables upon the use of words. 

An obvious application is to any col- 
*ction of words as metaphorical sym- 

ls. Those words (or concepts) 
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which by psychoanalytic theory, for 
example, are supposed to symbolize the 
phallus, the female figure, hostility, etc., 
ought to have their common symbolic 
character revealed in the interrelations 
and factor structure of their free asso- 
ciates. This is not to say that phallic 
symbol words ought to reveal a com- 
mon core of phallic meaning (by dic- 
tionary criteria) but that they ought to 
converge on some common factor. Ina 
word, no matter how remote their dic- 
tionary meanings and references in the 
real world, they should reveal a com- 
mon associative meaning. If they do 
not, it is hard to see how the particu- 
lar psychoanalytic symbol investigated 
could have any general validity for the 
population tested. 

Currently, the present author is 
studying the structure of associative 
meaning within a set of words that 
were selected from Rorschach protocols 
as indicators of aggression. No de- 
tailed account of these data can be 
given at present, but a striking qualita- 
tive aspect of them is the extent to 
which the overlap in associative mean- 
ing among them is mediated by a small 
number of response words, many of 
which appear out of context to be rela- 
tively neutral (Stick, for example). 

The technique may be used also to 
study the verbal organization associated 
with attitudes. Table 6 presents the 
factor structure of associations to words 
centering about religious values for in- 
dividuals who score high in religious 
value. A complete set of data is not 
yet available for low scorers, but it is 
almost certain that the most important 
difference between the high and low 
scorers will be in the extent to which 
the associations to these words are well 
organized. The factor loadings for low 
scorers will almost certainly be lower; 
indeed, many of the stimulus words 
(SERVICE, LOVE, HOPE, SPIRIT) will all 
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but disappear from the factor structure, 
and it is very likely that the factor 
structure of the remaining words will 
be very different. 


ORIGIN OF ASSOCIATIONS AND 
SUMMARY 


The last question we face concerns 
the origin of associations. The view 
of Locke that attributed them to the 
accidental contingencies of nature has 
led to a kind of mosaic theory of mind, 
a theory that has scarcely ever had a 
serious rival on its own ground, though 
it has often been challenged. 

No real substitute for the contiguity- 
similarity postulate is offered here. The 
highly organized economy of associa- 
tive meaning has impressed the author, 
however, and it was a belief that the 
human mind derived associations from 
categories of its own that sent him on 
the search for a technique by which to 
study associative meaning. Thus, the 
least that can be offered is the sug- 
gestion that associations derive in whole 
or part from the structures or cate- 
gories of the human mind. This belief 
is probably the mover of attempts, such 
as Woodworth's, to classify associa- 
tions. Such attempts are fruitless, 
however, for if there are categories 
in association, they are categories of 
association, not categories of subordi- 
nation, coordination, etc. This is ad- 
mittedly the belief that motivated the 
present work. — 

Thus, the implicit distributions of re- 
sponses that define associative meaning 
may exhibit patterns of overlap because 
they are derived from simple structures. 
An adult may use the word BUTTERFLY 
to correspond to a class of living crea- 
tures that fly and are bugs. This 
doesn't however, exhaust the associa- 
tive meaning of BUTTERFLY, for m 
flying bugs are dirty, unpleasant an 
YELLOW, while BUTTERFLIES are pretty, 


like BIRDS and flutter in the BLUE sky. 
Thus, another associative category in- 
trudes on the meaning of BUTTERLLY 
and makes it an associative concept not 
quite the same as any other. Contrary 
to zoology, associative BUTTERFLIES are 
as closely related to the nrmps as to the 
MOTHS, 

Large collections of words ought to 
enable us to discover the categories of 
association, if they exist, and how they 
change with age, if they do. In any 
event, an alternative to the contiguity- 
similarity tradition in association theory 
can come about only by a study of the 
associations themselves, and the pur- 
Pose of this paper is to present a 
somewhat different approach to the 
study of such associations. That ap- 
Proach consists in the assumption that 
the distribution of associates to any 
word provides the associative meaning 
of that word and the techniques neces- 
sary to apply this assumption to data. 
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CONTIGUITY AND REINFORCEMENT IN RELATION 
TO CS-UCS INTERVALS IN CLASSICAL 
AVERSIVE CONDITIONING? 


JOAN E. JONES? 
University of Sydney 


It is generally accepted that, in clas- 
sical aversive conditioning, the best re- 
sults are obtained if there is a temporal 
interval of approximately 450 milli- 
seconds between the presentation of the 
conditioned stimulus (CS) and the 
uncondtiioned stimulus (UCS). This 
observation creates a problem for the- 
orists who adopt the common sense 
notion of contiguity as a basis for the 
association of events in classical condi- 
tioning. Such theorists formulate the 
problem as one of explaining why 
separation of CS and UCS is superior 
to contiguity and of why an interval of 
450 milliseconds is better than others. 
Because the problem has been formu- 
lated in this way, the theoretical con- 
Structs proposed are rather arbitrary 
and have not been subjected to experi- 
mental tests. For example, Hull 
(1952) proposes a theoretical construct 
which bridges the temporal gap be- 
tween the stimulation of a receptor by 
the CS and the evocation of the un- 
conditioned response (UCR), namely 
the molar stimulus trace. The same 
function is performed by Guthrie’s 
(1935) concept of proprioceptive 
stimulation, though he does not explain 
why certain trace conditions are actu- 
ally superior to the simultaneous pres- 


1 Based on a thesis submitted in partial 
fulfilment of the requirements for the PhD 
degree, University of Sydney, 1959, super- 
vised by R. A. Champion. The author ex- 
presses appreciation to Ri. Bi Bromiley and 
M. Humphries for ei helpful criticisms 

of this article. 
ae Defence Research Medical Labo- 
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entation of CS and UCS. Cognitive 
theorists invoke the perceptual princi- 
ples governing grouping, particularly 
temporal proximity (e.g., Tolman, 
1949). Hilgard and Marquis (1940) 
interpret the classical situation in terms 
of reinforcement rather than of the 
contiguity of associated events, basing 
their interpretation on the observation 
that the most favorable interval ap- 
pears to be somewhat greater than the 
latency of the conditioned response 
(CR). 

Actually there is less agreement in 
the literature on optimum intervals 
than the usual textbook statement im- 
plies. Experimental evidence on the 
effect of the CS-UCS interval in con- 
ditioning human subjects is summar- 
ized in Table 1. In the experiments 
cited the classical defense method was 
used with the exception of Wolfle's 
where it appears that avoidance was 
possible. In considering Table 1 it 
should be noted that each of the stud- 
ies reported has observed the acquisi- 
tion of only one response. In com- 
paring any two studies it must be 
recognized that many experimental 
conditions are varied even where dif- 
ferent experimenters have employed 
the "same" response. The most im- 
portant of these is the experimenter’s 
original choice of intervals which may 
impose severe limitation on the opti- 
mum obtained. The range and the 
spacing of the chosen values have con- 
tributed to both artifactual similarities 
and dissimilarities in optima reported 
by different investigators. From Table 


— 
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TABLE 1 
EXPERIMENTAL STUDIES OF CS-UCS [INTERVAL EFFECTS 
Response Author Intervals sampled^ Optimum* 
Eyeblink McAllister (1953) 100, 250, 450, 700, 2500 250 
Bernstein (1934) —900, — 500, 100, 200, 250, | 300 
300, 500, 1000, 1480 
Hansche and Grant (1960) | 190, 390, 590, 790 390 (training and 
extinction) 
Kimble (1917) 100, 200, 225, 250, 300, 400 400 
Reynolds (1945) 250, 450, 1150, 2250 450 
Myers (1950) 500, 1000, 1500, 2000 500 
Finger with- | Wolfle (1932) —2000, —1000, —600, 300 (avoidance 
drawal — 200, 0, 200, 300, 400, 600, | possible) 
1000, 2000, 3000 
Wolfle (1930) — 500, — 250, 0, 250, 500, 500 (avoidance 
750, 1000, 1250, 1500 possible) 
Spooner and Kellogg (1947) | —500, — 250, 0, 500, 1000, 500 
1500 
Pupillary Geralland Woodward (1958)| 250, 500, 1500, 2500 1500 
dilation 
GSR Moeller (1954) 250, 450, 1000, 2500 450 . 
White and Schlosberg (1952)| 0, 250, 500, 1000, 2000, 4000 | 500 (extinction) 
Wickens, Gehman, and 500, 600, 700, 800, 920, 1040, | 500 & 1500 (two 
Sullivan (1959) 1550, 2500, 4600 optima) . 
Jones (1959) 240, 550, 740, 1060, 1575 550 (training) 
1060 (extinction) 
Jones (1959 unpublished) 500, 1000 1000 
Bierbaum (1959) — 3000, 0, 500, 3000, 5000 3000 


^in milliseconds 


l it will be seen that all the studies 
employing short latency responses 
(eyeblink and finger withdrawal) re- 
Port optima of 500 milliseconds, or 
less, and as short as 250 milliseconds, 
while the studies in which responses 
of longer latency have been used 
(pupillary dilation and GSR) yield 
Optima of 450 milliseconds or more, 
even as long as 3000 milliseconds. 
he difference is in the direction ex- 
pected by Hilgard and Marquis but 
the optima reported in five of the six 
studies are well short of the 
latency of that response. 
. This paper presents an interpreta- 
tion of the effect of the CS-UCS inter- 
3l in terms of the contiguity of the 
and the UCR and of reinforcement. 


This interpretation does not begin with 
the assumption of an invariant opti- 
mum interval but accepts the empiri- 
cally demonstrated fact that no single 
optimum exists. The propositions to 
be outlined differ from existing two- 
process theories of conditioning. 
Whereas Skinner (1935) and Schlos- 
berg (1937) invoke one principle 
(contiguity) in classical conditioning 
and another (reinforcement) in instru- 
mental conditioning, the present theory 
invokes the action of both principles 
in classical conditioning. It differs 
from Mowrer's (1947) two-factor the- 
ory of classical conditioning in that it 
contains no reference to intervening 
emotional reactions and no distinction 
is made between the principles govern- 
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ing the conditioning of responses on 
the basis of their neurophysiological 
characteristics. 


A Two-PniNciPLE THEORY OF 
CLASSICAL CONDITIONING 


When the CS-UCS interval is 
varied two temporal relationships are 
affected. One is the interval between 
the CS and the UCR, pertaining to 
the principle of S-R contiguity. The 
other is the interval between the CR. 
and the UCS, which defines the delay 
of reinforcement. "These changes are 
effected at the same time but may 
differ in extent and/or direction. 

The following assumptions are made 
about the operation of S-R contiguity 
and of reinforcement in classical 
conditioning : 

S-R Contiguity. The optimum con- 
dition for the establishment of a new 
S-R connection is that the CS and 
the UCR be contiguous in time. The 
greater the degree of temporal separa- 
tion of these events (in either direction) 
the weaker the resulting connection. 

Reinforcement. The optimum con- 
dition for the strengthening of S-R 
connections is that the CR coincides 
with a reinforcing state of affairs. 
The greater the degree of temporal 
displacement of the reinforcing event 
and the CR (in either direction) the 
less effective the reinforcement. 

The first statement modifies the 
strict contiguity principle maintained, 
for example, by Hull and by Guthrie 
by suggesting that a gradient operates 
both forward and backward in time. 
It is reasonable to suppose that the 
difference between connections based 
on strict contiguity and those based on 

lesser or greater degrees of discontigu- 
ity might differ quantitatively rather 
than qualitatively. The gradient can 
be used in this connection though the 
mechanism underlying it may not be 
specifiable at present. The question of 
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whether the gradient of reinforcement 
is unidirectional or bidirectional is a 
controversial one. The problem is 
complicated by conflicting experi- 
mental evidence and by a confusion of 
empirical and theoretical statements. 
Neither of the statements made above 
is meant to be an empirical generali- 
zation; as assumptions they may be 
justified by their usefulness in 
prediction. 

Historically, the delay of reinforce- 
ment principle belongs to instrumental 
learning where in some situations the 
experimenter can vary the period of 
delay by withholding the reward for 
any predetermined period of time after 
the subject makes the required re- 
sponse. He cannot do so in classical 
conditioning where, by definition, the 
UCS is presented in a fixed temporal 
relationship to the CS regardless of 
whether or when, the CR is evoked. 
The fact that the delay is not under 
direct experimental control does not, 
however, exclude the possibility that 
the delay of reinforcement principle 
applies in classical conditioning. Hil- 
gard and Marquis (1940) recognize 
this and interpret classical condition- 
ing in reinforcement terms but require 
that the CR be a preparatory reaction 
in which case the effect must be 
unidirectional. 

The present interpretation is less 
specific than that of Hilgard and Mar- 
quis. It would seem best at present 
to follow Spence in defining a reinforc- 
ing state of affairs empirically as be- 
longing to that class of “environmental 
events exhibiting the property of in- 
creasing the probability of occurrence 
of responses they accompany" (Spence, 
1956, p. 32). If the reinforcing event 
is defined in this way, the CR does 
not have to be the direct means of 
bringing it about for strengthening to 
occur but may be facilitated if a re- 
inforcing event occurs in that temporal 
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neighborhood. Just what event in 
classical conditioning comprises a re- 
inforcing state of affairs is a matter 
of discovery. However, because the 
UCS is the only event which is sys- 
tematically associated with the S-R 
Sequence it is reasonable to suppose 
that some feature of the UCS presenta- 
tion is the reinforcer, most likely its 
Onset or its offset. In conditioning 
fear reactions data presented by Mow- 
rer and Solomon (1954) indicate that 
UCS onset is the critical event. 
Where the UCS is of constant (and 
short) duration either assumption will 
lead to essentially the same prediction 
and in the following discussion UCS 
Onset is treated as the reinforcing 
event. 

It is proposed to adopt tentatively 
the assumption of Hull (1952), Guth- 
rie (1935), and of Hilgard and Mar- 
quis (1940) that the occurrence of a 
Tesponse at an effector is the event 
Whose temporal relations to other 
events is crucial in establishing learned 
Connections. While it is recognized 
that the latency of the UCR often dif- 
fers from the latency of the CR and 
that both vary from trial to trial, for 
Purposes of the present discussion the 
two will be treated as if equal and 
Constant. 

Consider the application of the the- 
Ory to the limiting case, i.e., where the 
Tesponse latency is zero. If the CS- 

CS interval is zero, the UCR starts 
at the onset of the CS and the CR 
Starts with UCS onset. Therefore, 
Simultaneous presentation of CS and 

S is most satisfactory as then both 
the gradient of contiguity and the 
Sradient of reinforcement are at maxi- 
mum strength. ]f the interstimulus 
Interval is lengthened in either the for- 
Ward or backward direction then both 
the relevant intervals are lengthened 

conditioning is poorer. 

n practice the response latency (x) 


is always greater than zero. In this 
case the CS-UCS interval which most 
favors contiguity between CS and 
UCR is — x (that is, backward pres- 
entation where UCS — CS =x). How- 
ever, if that is so, the CR is sepa- 
rated from UCS onset, the reinforcing 
state of affairs, by an interval of 2x, 
so that reinforcement is relatively 
weak. Strongest reinforcement will 
occur where the CS-UCS interval 
equals + r, but then the CS and UCR 
are separated by an interval of 2x dura- 
tion. So, if the interstimulus interval 
is such as to favor the effect of either 
the gradient of contiguity or of re- 
inforcement, it necessarily follows that 
the other is operating at less than 
maximum efficiency. The longer the 
latency of the response the greater the 
relative disadvantage to the nonfavored 
mechanism. 

In predicting the optimum interval 
the combination of overlapping gradi- 
ents must therefore be considered. 
The predictions made will differ ac- 
cording to whether the manner of their 
combination is assumed to be additive 
or multiplicative. The multiplicative 
relationship assumes that both the 
gradients must be at greater than zero 
strength for conditioning to occur. 
That is, neither alone is sufficient for 
the establishing and/or strengthening 
of conditioned responses; both must 
operate in conjunction throughout the 
learning process. But the reinforce- 
ment principle is concerned with tem- 
poral relationships between the CR and 
the UCS. This gradient cannot, by 
definition, have an effect on an S-R 
tendency until the strength of the tend- 
ency is such as to bring the CR above 
threshold. That is, the effects of re- 
inforcement are not felt until condi- 
tioning is already somewhat underway. 
How then, does the CS come to elicit 
the CR in the first place? The initial 
connection would appear to depend 
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on the operation of the contiguity prin- 
ciple alone. Unless the arrangement 
of stimuli is such that the CS and 
UCR fall within the effective range of 
this gradient the CR will never be es- 
tablished. As the strength of the new 
connection reaches threshold value re- 
inforcement begins to play a role. As 
the connection grows stronger and the 
response becomes more consistent the 
importance of reinforcement becomes 
correspondingly greater, until eventu- 
ally the effect of contiguity is negligible 
by comparison. It is more meaningful, 
therefore, to adopt the assumption that 
the gradients combine in an additive 
fashion weighting the contribution of 
each in accordance with its changing 
importance. 

The exact character of the predicted 
outcome depends on assumptions which 
are made concerning the shape, relative 
extent, and relative heights of the 
gradients. In the interests of simplic- 
ity and in the absence of direct evi- 
dence to the contrary it is assumed that 
both the contiguity and reinforcement 
gradients are linear. Though their 
actual extent is not specified it is en- 
visaged that it would be measurable 
in terms of seconds or even milliseconds, 
For the present it is assumed, again 
for simplicity and in the absence of 
direct evidence, that both gradients are 
symmetrical and that they are equal 
in extent. The change in the relative 
importance of the two mechanisms is 
represented by changing the relative 
heights of the gradients according to 
the stage of training, while their ex- 
tent remains constant throughout. 
Summarizing, the gradients are Jinear, 
symmetrical, and equal and limited in 
extent. The reinforcement gradient is 
at maximum height on the first trial 
(zero where there is no tendency for 
the CS to elicit the response to be 

conditioned before training begins) 
and rises as a function of the number 


Joan E. Jones 


of reinforced trials. This rise is ac- 
companied by a fall in the height of 
the contiguity gradient indicating its 
relative decline in importance once the 
CR is established. The gradients com- 
bine in an additive fashion to affect the 
strength of the learned S-R connec- 
tion, which is reflected in performance, 
other things being equal. 

As the absolute extent of the gradi- 
ents is not at present specifiable the 
amount of overlap for a response with 
a given latency is not known. However, 
relative statements can be made about 
the overlap for responses of short, me- 
dium, and long latencies. The over- 
lapping gradients of contiguity and 
reinforcement are shown on the left 
of Figure 1 for responses of various 
latencies at various stages of training. 
The latency of the response is shown 
in the schematic representation by the 
amount of separation at the maxima 
of the gradients. Thus, Part A of 
Figure 1 represents the hypothetical 
case where latency equals zero. The 
upper dotted line represents the gradi- 
ent of contiguity early and the gradient 
of reinforcement late in training. Con- 
versely, the lower dotted line indicates 
the gradient of reinforcement early 
and that of contiguity late in training. 
The solid line represents two gradients 
when equal in height at an intermediate 
Stage of training. Parts B, C, and D 
show the situation where the latencies 
are Short, medium, and long respec- 
tively. If latency equals x then the 
distance between the maxima equals 
2x. Parts BI, BII, and BIII of Fig- 
ure 1 show the change in relative con- 
tributions of the gradients at three 
stages of training, early, middle, and 
late, respectively. Thus, in Part BI, 
early in training, the height of the 
contiguity gradient exceeds that of the 
reinforcement gradient, in keeping 
with the argument that reinforcement 
is relatively ineffective at this stage. 
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In Part BIT the two gradients are 
€qual in height, while in Part BII 
the gradient of reinforcement is higher 
than the gradient of contiguity. Parts 
CI, CII, and CIII and DI, DII, and 
" represent the same changes in 
Importance of the gradients for re- 
Sponses with longer latencies. 
, The corresponding performance func- 
tions relating response strength to 
-S-UCS interval are shown on the 
"ight of Figure 1, each of these func- 
ina being obtained by additive com- 
Nation of the gradients on the left? 


i H 
pe "ihe value zero on the abscissae of the 
Ormance functions does not denote the 


Consideration of these performance 
functions leads to the following deduc- 
tions from the theory: 

1. With responses of latency greater 
than zero, the optimum interval for 
conditioning shifts with the amount of 
training (Parts BI, BII, and BIII). 
The direction of this shift is from a 
relatively short interstimulus interval 
early in training to a relatively long 
one later. The amount of shift is 
greater the longer the latency of the 
response being conditioned. (This 
shift in Parts BI-III may be compared 
with that in Parts CI-III and DI-IIT.) 

2. With responses of such a latency 
that the overlap of the gradients is 
slight (Part D) a second peak appears 
in the performance function if the CS 
tends to elicit the response to be con- 
ditioned from the outset, as is the case 
where a weak response tendency is being 
strengthened rather than a new con- 
nection being formed. The longer the 
response latency, the greater is the 
separation of the peaks. Two experi- 
ments which fulfill these necessary 
conditions reveal the double-peaked ef- 
fect. Wickens, Gehman, and Sullivan 
(1959) report 500 milliseconds best 
with a second peak at 1500 milliseconds 
and Jones (1961) obtained peaks at 
440 and 1045 milliseconds, The con- 
ditions for producing the second peak 
are fairly common, in particular with 
“emergency” responses such as the 
GSR which was the response used in 


simultaneous presentation of CS and UCS 
by the experimenter. The gradient of con- 
tiguity operates about a neural event conse- 
quent upon the occurrence of the CS rather 
than about the CS itself. In order to trans- 
form the functions shown here into forward 
and backward experimental arrangements of 
stimuli the zero point should be shifted to 
the left by an amount corresponding to the 
temporal lag between the CS and its neural 
counterpart. This distinction has not been 
made in labeling the figure as the predictions 
involve intervals of relative rather than of 


absolute lengths. 
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both the experiments cited. With 
many other responses, however, no 
such overt tendency is present before 
training and in some cases subjects 
who respond to the CS on a pretrain- 
ing test are discarded. It might be 
argued that a covert tendency to re- 
spond is always present and all that 
is ever observed is the strengthening 
of existing response tendencies, rather 
than the establishing of new ones. 
However, it is assumed here that for 
strengthening by reinforcement to 
occur the response tendency must first 
rise above threshold so that an overt 
response, though perhaps a weak one, 
occurs in the temporal neighborhood 
of a reinforcing state of affairs, The 
mechanism which acts to bring the 
latent tendency above threshold is con- 
tiguity of the CS and the required 
overt response, this contingency being 
arranged experimentally by the pres- 
entation of the UCS. Where the CS 
and UCR are too widely separated the 
response tendency cannot be brought 
to the necessary strength in the experi- 
mental session. Hence, only when the 
tendency has already reached the requi- 
site level, can the range of intervals 
spanned exclusively by the reinforce- 
ment gradient be effective in further 
strengthening the tendency in question. 
Hilgard and Marquis (1940) fail to 
indicate what mechanism acts to bring 
an initially subthreshold response 
tendency to sufficient strength for 
the operation of their reinforcement 
principle. 

3. If measures of response strength 
are summed over all stages of training, 
the longer the response latency the 
wider the range of intervals which are 
effective in producing conditioned re- 
sponses if the CS from the outset tends 
to elicit the response to be conditioned 
(compare the range of CS-UCS values 
embraced by the performance functions 
in Part B, C, and D). In addition, 
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the performance function is less 
sharply peaked with longer latency re- 
sponses. This prediction was tested 
by Jones (1961) in an experiment 
where two responses which were 
evoked by the same CS but differed 
in latency were conditioned in the same 
subject by the classical method. It 
was found that the function for the 
conditioned GSR was flatter and 
broader than that for finger movement. 

4. Where the response latency is so 
long that the gradients do not overlap, 
the range of intervals covered by the 
contiguity gradient will give increased 
response strength corresponding to the 
height of that gradient early in training 
but the strength of the tendency will 
not continue to increase and may de- 
Cline due to the lack of reinforcement. 
No other intervals will yield any con- 
ditioned responses unless the CS elicits 
the response to some extent from the 
outset. This case is not shown in Fig- 
ure 1 but is an extension of Prediction 
2 above. 

The above predictions differ sharply 
Írom any to be derived from other 
approaches to the problem of the in- 
terval effect. Apart from the analysis 
by Hilgard and Marquis there has 
been no explicit attempt to relate the 
effect to response latency. It is possi- 
ble, however, to deduce the appropriate 
interactions. Cognitive theory, which 
relates the efficacy of learning to the 
perceptual grouping of two stimulus 
events or the internal representations 
of such events, implies that characteris- 
tics of the responses evoked are irrele- 
vant. Therefore, although it is difficult 
for a cognitive theorist to explain the 
superiority of a particular interval, he 
certainly expects to find the same in- 
terval superior whatever the latency 
of the CR or UCR. Hull is concerned 
with the contiguity of the stimulus 
trace and an effector activity as is clear 
from his statement of the primary re- 
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inforcement postulate (Hull, 1952, p. 
5, Postulate III). Therefore, if the 
trace is at its maximum strength aíter 
450 milliseconds, then: (a) an inter- 
stimulus interval of 450 milliseconds 
is superior only when the UCR has a 
latency approaching zero; (b) the 
longer the latency of the UCR the 
shorter should be the optimum CS- 
UCS interval; (c) where the response 
latency is longer than 450 milliseconds 
backward presentation of CS and UCS 
is most favorable; (d) optima longer 
than 450 milliseconds cannot occur re- 
gardless of the characteristics of the 
response. Guthrie also implies that a 
response rather than a neural event 
must be contiguous with the cue 
(which is almost certain to be a medi- 
ating stimulus event). It follows from 
Guthrie's analysis, as it does from 
Hull’s, that the longer the latency of 
the UCR the shorter the optimum 
interval if the UCR is to be contiguous 
with a consistently occurring "true" 
CS. None of these positions are sup- 
ported by the experimental evidence 
at present available. 

The deductions from the two- 
principle theory may be formulated in 
terms of a comparison of trends in 
the learning curves for different inter- 
stimulus intervals. With response 
latency held constant, it is predicted 
that short-interval groups show a com- 
paratively rapid initial improvement 
and the acquisition curves then flatten 
out and may even decline (see the 
rise and fall of the left wing of the 
performance function in Figure 1, 
Parts CI-III). Longer-interval groups 
(right-wing of the performance func- 
tion) show a less rapid rise initially 
but eventually reach, and may even 
Surpass, the shorter groups. The rela- 
tionship between the curves is shown 
in Figure 2. The other theories men- 
tioned above do not anticipate a 
Change in pattern among the curves 
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` 
CS-UCS 'ZERO' 
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TRAINING 
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Fic. 2. Relationships among performance 
curves in the acquisition of a CR with vari- 
ous CS-UCS intervals. 


during acquisition but can predict only 
that the pattern established early be- 
comes more distinct as training pro- 
gresses. In a direct test of the pre- 
diction, Jones (1961) found the ex- 
pected pattern among the acquisition 
curves for groups in which the GSR 
was conditioned at various intervals 
from 20 to 1245 milliseconds. 

Further evidence supporting the 
prediction is found in the learning 
curves obtained by Moeller (1954) for 
the GSR and McAllister (1953) for 
the eyeblink which show that longer- 
interval groups approach their perform- 
ance asymptote at an initially slower 
rate than shorter-interval groups. 
Jones (1959) observed that groups in 
which the GSR was conditioned with 
550 and 1060 milliseconds intervals 
respectively approached approximately 
the same asymptote but that the 550 
milliseconds group did so much more 
rapidly. It may be noted that White 
and Schlosberg (1952), who report a 
short optimum of 500 milliseconds for 
the GSR using measures taken on ex- 
tinction trials, trained their subjects 
for but five conditioning trials. 

It is often reported of simultaneous 
and backward presentation of the CS 
and UCS that the performance curves 
show a brief initial rise followed by 
a gradual decline (Spooner & Kellogg, 
1947). This is the type of effect that 
would be expected under these condi- 
tions from the analysis given here. 
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The explanation commonly offered in- 
vokes a new mechanism sometimes 
referred to as “sensitization” but no 
additional concept is required to incor- 
porate the effect here. Backward con- 
ditioning emerges as a special case of 
the interval effect rather than differing 
qualitatively from forward condition- 
ing. Hull (1943, p. 172) suggested a 
treatment of this kind in his discussion 
of Wolfle’s data but his later develop- 
ment of the stimulus trace concept 
made such interpretation difficult 
where the UCR as well as the UCS 
preceded the CS. Confirming the pos- 
sibility of real backward conditioning, 
Champion and Jones (1961) found 
that a group in which the GSR was 
conditioned with a backward interval 
of 750 milliseconds performed better 
than did a pseudoconditioning control 
group. 

Another prediction which follows 
from the present interpretation con- 
cerns the effect of a shift of inter- 
stimulus interval during the course of 
training. Suppose that for a group of 
subjects forward conditioning begins 
with a short interval in the range most 
favorable for the effect of contiguity 
until the CR begins to appear fairly 
consistently. If at this stage the inter- 
val is changed to a longer one, more 
favorable for the effect of reinforce- 
ment, performance will surpass that of 
control groups conditioned throughout 
at either the shorter or the longer in- 
terval. Further, long latency responses 
are able to tolerate larger shifts of this 
kind than short latency responses, 

If the CS-UCS interval is held con- 
stant and the learning curves for re- 
sponses of different latencies are com- 
pared they will differ in shape in a 
manner depending on the range of val- 
ues in which the interval is held. At 

an intermediate range of interval val- 
ues the curves for different responses 
have the same shape because contiguity 


and reinforcement are favored equally 
regardless of response latency. How- 
ever, with intervals in the shorter 
range (e.g., as in backward condition- 
ing) separation of CR and UCS onset 
(reinforcement) is much greater than 
the separation of CS and UCR (con- 
tiguity) and the difference between 
the two is emphasized with a long 
latency response. As it has been pro- 
posed that contiguity is of greatest im- 
portance early in training but that re- 
inforcement gains in importance dur- 
ing the later stages, the longer latency 
response shows the more rapid initial 
rise, but later in training the shorter 
latency Tesponse reaches and maintains 
a higher level of performance. Con- 
versely, if the interval is lengthened 
beyond the intermediate range (as in 
forward conditioning) then contiguity 
is more adversely affected than is re- 
inforcement, This difference is again 
emphasized where the response has a 
long latency. Therefore, in this case 
the performance curve for the short 
latency response shows a more rapid 
initial rise but is later surpassed by 
that for the long latency response. 
The predicted relationship was ob- 
tained by Jones (1961) in condition- 
ing finger withdrawal and GSR with 
forward intervals, 

The two-principle theory of classical 
conditioning put forward in this paper 
leads to the set of predictions outlined 
above, some of which have already 
been confirmed. These predictions 
cannot be derived from other theories 
in their present form. No theorist to 
date has discussed the interval effect 
in relation to the course of learning 
and only Hilgard and Marquis have 
attempted to relate the optimum CS- 
UCS interval to the latency of the 
response being conditioned. When 
these factors, i.e., amount of training 
and response latency, are considered 
in terms of the constructs offered by 
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others in explanation of the interval 
effect the predictions of the inter- 
actions concerned differ markedly from 
those of the present interpretation. 


SUMMARY 


The function relating the interval 
between the onset of the CS and the 
UCS to response strength in classical 
aversive conditioning is interpreted in 
terms of the joint action of two princi- 
ples. Early in training the efficacy 
of conditioning depends mainly on the 
contiguity of CS and UCR but later 
mainly on the proximity of the CR to 
reinforcement (UCS). In both cases 
a bidirectional gradient is assumed to 
Operate such that the greater the dis- 
Contiguity of the events the less effec- 
tive the learning. It is predicted that 
variations in experimental procedure, 
Such as the latency of the response 
being conditioned, which affect either 
one or both of these relationships affect 
the function relating the CS-UCS in- 
terval to response strength and that 
the shape of the learning curves de- 
pends on the CS-UCS interval used 
n training. 
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This paper introduces a new deter- 
ministic model for analyzing decision 
making in gambling, the pattern and 
level of risk (PLR) model. At the 
beginning of the paper, the experi- 
mental evidence for four traditional 
models is examined and the inade- 
quacies of these models shown. Then 
the PLR model is described, predic- 
tions are drawn from the model, and 
evidence is presented to support the 
predictions. Finally some implications 
of the PLR model are shown. 


Tue EVIDENCE ror TRADITIONAL 


Mopbets 
Edwards (1955) described four, 
related deterministic models which 


have, in large part, guided experi- 
mentation on gambling decisions. All 
four assume that a gambler will choose 
the alternative (e.g., a bet) with the 
maximum value, and that the value of 
an alternative can be calculated by 
multiplying the value of each of its 
Outcomes by the probability that the 
outcome will occur and summing these 
products over all outcomes. The 
models differ with respect to whether 
Objective or subjective values and 
probabilities are used in these calcula- 
tions. The four models are defined 
by the following equations for calculat- 
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ing the value of an alternative: 


EV = = pid: [1] 
SEM = 2 Pi*$; [2 

EU = È pits [3] 
SEU = Z Pytu; [4] 


i 


Equation 1 defines expected value 
(EV), where p; and $; stand for the 
objective probability and dollar value 
of the ith outcome and X/i means that 
the products f;$; are summed over all 
of the potential outcomes of the alter- 
native. Equation 2 defines subjectively 
expected money (SEM), where Pj* 
refers to the subjective probability of 
the ith outcome and the other symbols 
have the same meanings as above. 
Equation 3 defines expected utility 
(EU) where u; refers to the utility 
(subjective value) of the ith outcome 
and the other symbols have the same 
meanings as above. Equation 4 de- 
fines subjectively expected utility 
(SEU); all of the symbols have the 
same meanings as in the other models. 

In evaluating these models, three 
criteria will be kept in mind: (a) the 
range of alternatives or choice situa- 
tions within which each model has 
predictive value; (b) the accuracy 
with which each model predicts de- 
cisions within its range; and (c) in 
ranges where both have predictive 
value, whether a model containing a 
subjective parameter is any better than 
the comparable one containing an ob- 
jective parameter. The last considera- 
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tion is important from the point of 
view of parsimony. If a model with 
an objective parameter is as good at 
predicting decisions within a certain 
range as the comparable model with a 
subjective parameter, then the former 
should be preferred because it requires 
no special measurement procedures, 


The EV Model 


The EV model (and the SEM and 
EU models as well) is, of course, in- 
applicable in situations where the ob- 
jective parameters which compose it 
are not defined. In addition, it is un- 
likely to be valuable where objective 
parameters, though defined, are not in 
some way communicated to the gam- 
bler or where individual tastes are 
obviously important in determining the 
value of outcomes (as in preference 
among foods). There remains a hard 
core of situations in which objective 
probabilities are fairly apparent and 
outcomes are money or objects easily 
convertible into money. Here the EI” 
model is at least worthy of considera- 
tion. Even when these conditions are 
met, the EV model is known to be 
grossly inadequate in the range of bets 
which have very small probabilities of 
winning or losing large amounts of 
money (Allais, 1953; Bernoulli, 1954). 
The inaccuracy of the EV model in 
these cases is shown by the willingness 
of people who are well informed about 
the probabilities to buy lottery tickets 
and insurance (where EV is consider- 
ably less than zero) and to play the 
St. Petersburg Paradox only at very 
low stakes (where EV is infinite at 
any stake). . 

In the range of bets which have 
moderate probabilities and values of 
outcomes, research findings seem to 
support the following generalization : 
The EV model is a poor predictor of 
decisions where EV differences are 
small, but improves in predictive ca- 
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pacity as EV differences increase. 
Thus, the model is useless for predict- 
ing choices between bets with the 
same EV ; the model predicts indiffer- 
ence between such bets whereas many 
experiments have revealed consistent 
preference (e.g., Coombs & Komorita, 
1958; Edwards, 1953). Mosteller 
and Nogee (1951) found that predic- 
tion improved with an increase in EV 
difference as follows: Where the dif- 
ference in EV between two two-out- 
come bets lay between 2¢ and 504, 
only 4296 of the predictions were cor- 
rect (50% would be expected by 
chance) ; between 55¢ and $1.55, 57% 
were correct ; between $1.80 and $2.20, 
60% were correct; and between $2.50 
and an undisclosed maximum, 6746 
were correct. These percentages would 
undoubtedly have improved somewhat 
if the bets had been replicated and 
“dominant” Preferences established ; 
but they are low enough to suggest 
that the EV model leaves much to be 
desired as a predictive device, 

One other generalization about the 
Scope of the EV model is suggested by 
an experiment by Edwards (1954a) : 
The less complex the bets between 
which a decision is made, the more 
accurate will the EV model be in pre- 
dicting these decisions. 


The SEM Model 


There is one kind of bet, very com- 
mon in everyday gambling situations 
Such as roulette or horseracing, in 
Which the SEM model makes grossly 
Incorrect predictions. This is the bet 
in which a person may stake as much 
money as he likes on a chance event 
and wins a fixed multiple of his stake 
if the chance event occurs. In such 
cases, the SEM model (and the EY 
model) predicts that if a man will 
bet at all on a chance event, he 
will put all the money he has on 
that chance event, a prediction which 
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is incompatible with everyday observa- 
tions of gamblers. However, in the 
typical laboratory situation, where 
only two bets are available at a time, 
the model does not lead to absurd 
predictions. 

Laboratory data appear to support 
two generalizations about the SEM 
model: (a) The SEM model is fairly 
accurate in predicting choice between 
simple bets which differ in probability 
of winning or losing but whose out- 
comes involve the same general level of 
money. (b) The SEM model is only 
slightly if any better than the EV 
model in its ability to predict choice 
between bets with the same probabil- 
ities of winning and losing but differ- 
ing in level of money. 

Evidence for the first generalization 
comes from the work of Edwards on 
probability preferences (1953, 1954a, 
1954b, 1954c). When asked to choose 
between bets with equal EV but dif- 
fering in probability of winning, the 
Subjects showed a marked group pref- 
erence for certain probabilities (e.g., 
4/8) and a marked dislike for others 
(e.g., 6/8). The same preference pat- 
terns were found when the amount of 
money involved in the bets was in- 
creased, making an interpretation in 
terms of the EU model very unlikely. 
Similar results were found for indi- 
vidual cases by Coombs and Pruitt 
(1960), who pointed out in addition 
that people differ in the probabilities 
they prefer. The SEM model would 
appear to be the simplest way of 
explaining these preferences, and a 
test of this explanation was made by 
Edwards (1955). First he used a 
betting method which assumed the 
SEM model to measure for each sub- 
Ject the subjective probability of win- 
hing and losing at eight levels of prob- 
ability, Then he inserted these values 
into the SEM model to predict prob- 
ability preferences. Eighty-three per- 


cent of the choices between winning 
bets and 65% of the choices between 
losing bets were successfully predicted. 
Other evidence suggests that predic- 
tions would have been even better if 
the choices had been replicated enough 
times to establish the dominant prob- 
ability preference pattern for each sub- 
ject. It should be noted that the EV 
model is clearly inadequate to explain 
these data since all of the bets had 
the same EF level so that there was 
no basis for making predictions. 
Evidence for the second generaliza- 
tion comes from an experiment by 
Suppes and Walsh (1959) in which 
predictions by the SEM model to 
choices between two-outcome bets with 
a subjective probability of winning of 
4 were correct only 57% of the time. 


The EU Model 


The measurement of utility has 
turned out to be a very difficult and 
elusive task. At least seven methods 
have been used over the past 10 
years, three assuming the EU model 
and four the SEU model. In each 
case the utility values thus measured 
have been put back into the model 
to predict choices between pairs of 
bets. Thus, to a greater or lesser 
extent, these studies constitute a test 
of the EU and SEU models. 

Two of the three EU experiments 
have not been adequate tests of the 
model. One, by Edwards (1955) 
used a method of measuring utility 
which he has since repudiated (1961) ; 
the results were inconclusive. The 
other by Coombs and Komorita (1958) 
yielded 29 out of 30 correct predic- 
tions, but all 30 would have been cor- 
rect on the assumption that the sub- 
jects preferred less risk to more, a 
characteristic they showed throughout 
their behavior. Furthermore only 
three subjects were used. 

The third study (Mosteller & 
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Nogee, 1951) is a better test of the 
model but there are ambiguities in 
interpreting it. (Some of the results 
of this study have already been pre- 
sented in the section on the EV 
model.) Utilities of money were gen- 
erated and used to predict choices be- 
tween bets at a number of different 
levels of probability and money. These 
predictions were correct in 66% of 
the pairs, as opposed to 50% success 
with the EV model (expectation by 
chance was also 5096). As in the 
case of predictions from the EV model, 
the accuracy of prediction increased 
with an increase in EU difference 
between the bets: in pairs where the 
difference was over $2.50, 93% of the 
predictions were correct. One might 
conclude from this study that: The 
EU model is somewhat better than 
the EV model when the EU difference 
between bets is small and improves 
markedly as the EU difference in- 
creases. However, the method used 
for measuring utility is ambiguous; 
the decisions made by the subjects in 
the utility measuring sessions could 
just as well have been determined by 
subjective distortions of probability as 
by departure of the utilities from 
money values. If this had been true, 
the SEM model would have made as 
good or better predictions than the 
EU model; this possibility was not 
examined, although data were avail- 
able to do so. Conclusions about the 
value of the EU model must await this 
further check. 


The SEU Model 


Aside from Edwards' (1955) ex. 
periment mentioned in the previous 
section, which employed a question- 
able method for measuring utility and 
will therefore not be further discussed, 
all the experiments on the SEU model 
have used a chance event with both 
objective and subjective probability 
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equal to $. This means “two things: 
(a) that the model remains untested 
over a broad range of bets and (b) 
that the EU and SEU models cannot 
be compared on the basis of existing 
data. 

In the case of bets involving money, 
the SEU model does not appear to 
have been particularly successful. One 
study (Davidson, Suppes, & Siegel, 
1957, pp. 19-81) claims Success, but 
the test of the SEU model was very 
weak permitting quite variable results 
to be considered positive evidence; the 
results must be considered inconclu- 
sive. The other experiment, done by 
Suppes and Walsh (1959), was a 
much stronger test of the model. The 
SEU model predicted correctly only 
3876 of the choices between paired 
comparisons (chance expectation : 50%) 
and was only slightly better than the 
SEM model which predicted correctly 
57% of the choices. 

The SEU model made a somewhat 
better showing in one study which 
used bets composed of outcomes other 
than money. Davidson, Suppes, and 
Siegel (1957, pp. 82-103) measured 
the utility of phonograph records with 
a linear programming method. De- 
pending on certain minor assumptions, 
predictions about choices between bets 
containing these phonograph records 
were correct 67% or 71% of the time. 
Another study (Hurst & Siegel, 1956) 
used cigarettes as the outcome, but the 


number of correct predictions was not 
reported. 


Summary Evaluation 
tional Models » 


"T 

All four of the traditional models 
have shown some success in predicting 
choices between bets, but most of the 
Successes have been of moderate mag- 
nitude (57% to 71% correct). There 
are two possible exceptions to this 
generalization, each involving restricted 
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ranges of bets: (a) When positive ex- 
pected value bets differ in probability 
of winning but have roughly the same 
level of money in their outcome, the 
SEM model is quite successful (83% 
correct). (b) When EU differences 
are very large, the EU model seems 
to do a very good job (93% correct) 
but there are some ambiguities in in- 
terpreting these findings. Aside from 
these limited findings, the four models 
have shown only moderate success. 


Wuere Do We Go Now? 


There are two directions which the 
study of gambling behavior might take 
from here: (a) further refinement of 
the traditional models, eg. new at- 
tempts to measure utility and subjec- 
tive probability, more comprehensive 
Studies of the range in which these 
models have predictive value, or (b) 
Search for new models. The first di- 
rection has much to be said for it; 
there are many gaps in our knowl- 
edge; most of the generalizations stated 
above could stand re-evaluation with 
new data. But there are other consid- 
erations which strongly suggest the ap- 
propriateness of the second direction: 
In an infant science, such as the study 
of decision making, most approaches 
are likely to be wrong so that the 
need for innovation is likely to be 
greater than the need for model re- 
finement. Furthermore the results 
discussed above are not really en- 
couraging—a great deal of the vari- 
ance is still not explained—and there 
is no reason to believe that the tradi- 
tional models will be able to account 
for it. In view of these arguments, 
the author has opted for a search for 
new models. 

This does not mean that we should 
Utterly reject the traditional models; 
he this point they are the best we 

ave, Because they are the best we 
ave, they can and must be used in 
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building models of other situations 
which resemble gambling (for exam- 
ples see Siegel, unpublished ?; Snyder, 
1960; Tanner & Swets, 1954). But 
some effort must also be made to im- 
prove our understanding of gambling. 

The search for new directions, which 
has culminated in the PLR model of 
this paper, was begun by Coombs and 
the author (1960) in a study of the 
utility of risk. In this study, utility 
of risk was conceptualized in terms of 
“variance preference’: the greater a 
person's utility of risk the higher 
variance he will prefer. A partial 
model of variance preference was de- 
vised: that the order of preference 
among variances for any given prob- 
ability of winning will be a folded J 
scale of variance. And this partial 
model was overwhelmingly supported 
by the data. This and some other 
regularities of behavior found in that 
study plus some observations of real 
life gambling have led inductively to 
a more comprehensive partial model, 
the PLR model. 


THE PATTERN AND LEVEL OF 
Risk Moprr 


The model which is developed in 
this paper applies only to bets which 
have a least one negative outcome, i.e., 
which have some element of risk. In 
the discussion section, some specula- 
tions will be presented about the pos- 
sibility of generalizing it to all bets. 


Concepts 


Every bet which has at least one 
negative outcome can be completely 
described by specifying its “pattern 
of risk” and “level of risk.” The pat- 
tern of risk (“pattern” for short) of a 
bet is determined by the number of 
possible outcomes, the probability of 

2 Siegel, S. Decision making and learning 


under varying conditions of reinforcement. 
Unpublished manuscript, 1960. 
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achieving each outcome and the ratio 
of the outcomes to one another. Two 
bets have the same pattern if they have 
the same number of outcomes at the 
same probabilities and if the outcomes 
of one can be obtained by multiplying 
the corresponding outcomes of the 
other by a constant. Thus two bets 
with the same pattern are: 


i chance to win 50¢ 

A: 4 chance to lose 50¢ 
. $ chance to win $1 
B: j chance to lose $1 


Two others are: 


1/2 chance to win $1 
C: 1/3 chance to lose 75¢ 
1/6 chance to lose 60¢ 


1/2 chance to win $3 
D: 1/3 chance to lose $2.25 
1/6 chance to lose $1.80 


The level of risk of a bet is defined as 
the sum of its negative outcomes 
weighted by their respective proba- 
bilities of occurrence. Thus the level 
of risk of Bet A is 25¢; of Bet B, 50¢; 
of Bet C, 35¢; and of Bet D, $1.05. 

The concepts of pattern of risk and 
level of risk may seem arbitrary at 
first: A bet can be described in many 
terms; why choose these? It is the 
contention of the author that people 
usually perceive bets in these terms, 
The evidence for this lies in the struc- 
ture of most popular gambling situa- 
tions, where characteristically there is 
a sharp distinction between the chance 
events and payoff ratios on the one 
hand (pattern of risk) and the size 
of stake on the other (level of risk). 
For instance, in roulette each of the 
squares on which stakes may be placed 
corresponds to a specific chance event 
and has a fixed ratio of payoff to 
stake; thus each of the squares cor- 
responds to a specific pattern of risk. 
The size of the gamblers stake, which 
determines the level of risk, is com- 


pletely distinct from the square on 
which he puts it. In betting on horse 
races, the gambler chooses a pattern 
by deciding which horse to bet on and 
a level of risk by deciding what size 
ticket to buy. In friendly wagers, 
separate announcements are often 
heard of the pattern of the wager (e.g., 
"I'll give you 5 to 3 on Navy”) and 
the amount bet (“O.K., I'll put $3 
on Army."). Finally, in investment a 
sharp distinction is drawn between the 
company or bond invested in (pat- 
tern) and the number of shares or 
bonds bought (level of risk). 

In building the PLR model, it is 
assumed that people have separate at- 
titudes toward pattern and level of 
risk and that these interact to deter- 
mine gambling decisions. Attitude to- 
ward pattern is embodied in the con- 
cept "utility of pattern," each pattern 
being assumed to have its place on a 
ratio scale of utility. In cases where 
utility of pattern is measured on an 
ordinal scale, we may speak in terms 
of pattern preferences (eg. “Pattern 
X is preferred to Pattern Y."), At- 
titude toward level of risk is embodied 
in the concept "utility of risk," which 
is assumed to be measurable on a ratio 
Scale. 


The PLR model 


At this stage in the development of 
the pattern and level of risk model 
(PLR model), no assumptions have 
been made about the nature of the 
relationship between utility of pattern 
and pattern itself other than the as- 
Sumption that this relationship differs 
Írom person to person. Some specula- 
tions about this question will be pre- 
sented in the discussion section. The 
function relating utility of risk to level 
of risk is also assumed to differ from 
person to person but to have in com- 
mon for all people three characteristics : 
(a) being negative for all levels of risk, 
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(b) being a negatively accelerated 
decay curve, and (c) being zero when 
the level of risk is zero. 

'The PLR model is analagous to the 
traditional models in as much as it 
provides a formula for computing the 
utility of alternatives and assumes that 
the gambler will choose the alternative 
which has the greatest utility. This 
formula may be stated as follows: 


U(X) = rx-u(px) + e(rx) [5] 
where: 


U(X) is the utility of alternative X. 

rx is the level of risk involved in X. 

u(px) is the utility of the pattern 
px of X. 

g(rx) is the utility of risk. 


Causality moves in only one direction 
in this model: U(X) is the dependent 
variable and the terms on the right of 
the equation are independent variables. 
Again it should be emphasized that the 
PLR model applies only to bets which 
have at least one negative outcome. 
The model is illustrated in Figure 
1, where U(X) is shown as a function 
of ry. An intuitive account of the 
model follows. In order to discuss 
the model intuitively, a name will be 
needed for the first term on the right 
hand side of Equation 5. This is 
called, heuristically "expectation of 
gain"; the greater this term, the more 
the gambler "expects to get out of a 
bet" The term has the following 
properties, all intuitively compelling: 
As the utility of a pattern u(px) in- 
creases, expectation of gain from bets 
embodying that pattern also increases. 
In the case of patterns whose utility 
u(py) is positive, expectation of gain 
Will increase as the amount of money 
Tisked ry increases, since for a given 
Pattern, increasing the amount risked 
also increases the amount which may 
€ won, But where the utility of a 
Pattern is negative, expectation of gain 


Illustration of the PLR model 
showing the utility of an alternative U(X) 
as a function of the level of risk involved 


Fic. 1. 


in that alternative rx. Also shown are the 
components of U(X), the expected gain 
rx'u(px) and the utility of risk g(rx). 
(The ideal level of risk, which comes at 
the point where U(X) is maximal, and the 
maximum acceptable level of risk, MALR, 
are indicated.) 


decreases as the amount of money 
risked increases. 

Expectation of gain and utility of 
risk are conceived intuitively as hy- 
pothetical constructs, having genuine 
referents in the form of two conflicting 
or cooperating behavior tendencies 
within the individual. These tend- 
encies cooperate when the utility of a 
pattern is negative. When the utility 
of pattern is positive, however, these 
tendencies come in conflict and the 
conflict increases as the level of risk 
increases. Because the utility of risk 
is a negatively accelerated decay curve 
over level of risk, the model predicts 
what we know from experience (and 
what the SEM model failed to pre- 
dict), that there is a limit to the 
amount of money a man will risk on a 
pattern which has at least one negative 
outcome, no matter how favorable this 


pattern may appear. 
Derivations from the Model 


In this section will be presented five 
propositions drawn from the PLR 
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model. All of them refer to the be- 
havior of individuals rather than 
groups. 


Proposition 1. At any level of risk, 
the order of preference among patterns 
of risk will be the same as at any other 
level of risk. 

Proof. Take any two bets X and Y 
at level of risk R, where X has pattern 
px and Y has pattern px and assume 
that X is preferred to Y, ie, U(X) 
2 U(Y). By Formula 5, this implies 
that: 


R-u(px) + g(R) > R-u(py) + g(R) 
which simplifies to: 


u(px) > u(by) 


Now take any other level of risk S and 
find the two bets X’ and Y’ which 
have the same patterns, px and py. 
Since we know that: 


u(px) > u(py) 
it follows that: 


S-u(bx) + g(S) > S-u(by) + g(S), 


which implies by Formula 5 that 
U(X") 2U(Y'), ie, that X' is pre- 
ferred to Y', Thus it is shown that 
the order of preference between any 
two patterns of risk at any level of risk 
will be the same as the order of prefer- 
ence between the same patterns at any 
other level, from which Proposition 1 
follows. 


Note. In the next four propositions, two 
new terms are introduced: the "ideal level 
of risk," which is the level of risk which 
the individual most prefers for a given 
pattern of risk, and the "maximum accept- 
able level of risk," which is the highest level 
of risk he will voluntarily accept for a 
given pattern. 


Proposition 2. The more preferred 
a pattern of risk, the higher will be the 
ideal level of risk for that pattern. 

Proof. This proposition will be 
demonstrated with reference to Figure 


Pruitt 


1. The ideal level of risk is found, of 
course, at the point where the curve 
U(X) reaches a maximum. It should 
be immediately apparent that a person 
can have only one ideal level of risk 
Since the sum of a straight line and 
a curve which is negatively accelerated 
is itself a negatively accelerated curve 
and, therefore, can have at most one 
maximum. By calculus, this maxi- 
mum occurs at the level of risk where: 


U'(X) = u(px) + g'(rs) = 0 


i.e., where: 


u(px) = — g' (rx) 
To put this in other words, the ideal 
level of risk for a pattern is found 
where the utility of that pattern equals 
the negative slope of the utility of risk 
curve, 

Now assume two patterns of risk 
P and Q such that u(P) »u(Q), ie. 
that P is more preferred than Q. Ac- 
cording to our calculus, the negative 
slope of the utility of risk curve will 
be greater at the ideal for P than at 
the ideal for Q. Since the negative 
slope of the risk curve increases with 
increasing level of risk, this means 
that the ideal for P will be found at 
a higher level of risk than the ideal 
for Q. 

Proposition 3. The more preferred 
a pattern of risk, the higher will be 
the maximum acceptable level of risk 
for the pattern, 

Proof. This proposition will also be 
demonstrated with reference to Figure 
1. It is well known that a curve which 
is negatively accelerated throughout 
can be crossed by a straight line at 
most two times. In this case, the func- 
tion U(X) of rx is the negatively ac- 
celerated curve and the level of risk 
axis (the abscissa) is the straight line. 
By definition these curves cross one 
another at the origin. The other cross- 
ing, if it exists, is the maximum ac- 
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ceptable level of risk, i.e., the point 
where U(X) =0. By algebra, the 
maximum acceptable level of risk for 
a pattern px occurs at the point where: 


u(px) = — g(rx)/rx 


ic. where the utility of this pattern 
equals the negative utility of risk di- 
vided by the level of risk. 

Now assume two patterns of risk P 
and Q such that w(P) 2» «(Q), ie. 
that P is preferred to Q. According 
to our formula, this means that the 
ratio of the negative utility of risk di- 
vided by the level of risk will be 
greater at the maximum for P than at 
the maximum for Q. Since the func- 
tion relating the utility of risk to level 
of risk is a negatively accelerated 
growth curve, this ratio will be greater 
the greater is the level of risk. There- 
fore the maximum level of risk for P 
will be higher than the maximum 
level of risk for Q. 

Proposition 4. For any given pat- 
tern of risk, divide the levels of risk 
into two groups: those smaller than 
the individual’s ideal and those greater 
than his ideal. Within each group, 
levels of risk will be more preferred 
the closer they are to the ideal level 
of risk. 

Proof. As was mentioned above 
under Proposition 2, the ideal level of 
risk occurs at the point where U(X) 
reaches a maximum. Since U(X) is 
negatively accelerated throughout, it 
can have at most one maximum and no 
minima. The proof of Proposition 4 
comes from the nature of such a 
maximum ... namely, that the curve 
rises ever higher as it approaches the 
maximum and declines ever more 
Sharply as it proceeds beyond the max- 
imum. This implies a theorem that 
?n either side of the ideal the utility 
9f bets with a given pattern increases 
75 the level of risk approaches the ideal 
"Vel of risk, Proposition 4 is a re- 


statement of this theorem in preference 
language. 

Proposition 5. For any given pat- 
tern of risk, all levels of risk up to 
the maximum acceptable level of risk 
will be acceptable. 

Proof. This proposition follows 
from the assumption, stated in the 
discussion of Proposition 3, that the 
function of U(X) of rx can cross the 
level of risk axis at most two times, 
the first of which is the origin and 
the second, if it exists, the maximum 
acceptable level of risk. Since it can- 
not cross the level of risk axis between 
these points, it remains positive be- 
tween them . . . all levels of risk up to 
the maximum level have a positive 
U(X), which means that they are all 
acceptable. 


Experimental Evidence for the Propo- 
sitions 


The experimental evidence presented 
in this section is derived from two 
experiments on gambling. 

The method and part of the results 
of the first experiment are reported 
more extensively elsewhere (Coombs 
& Pruitt, 1960). The procedure con- 
sisted of presenting a large number of 
forced choices between two-outcome 
bets, such as the following: 


1/3 to win $1.40 
2/3 to lose 70 cents 
1/2 to win $1.00 
Bet B: 1/2 to lose $1.00 


Bet A: 


All bets had an expected value (EV) 
of zero. The bets were organized 
into five sets. Those in Sets I, II, and 
III had a constant probability of win- 
ning and differed in variance? The 


3 The variance of bets of this type is 
computed by the formula o=pq(a—b)*, 
where p and q are the probabilities of win- 
ning and losing respectively, and a and b 
are the amounts to be won and lost. 
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probability of winning for Set I was 
1/3; for Set II, 1/2; and for Set Ill, 
2/3. There were six levels of vari- 
ance in each set: .12, .30, 1.0, 5.0, 25.0, 
and 100.0. The bets of Sets IV and V 
had a constant variance and differed 
in probability of winning. Set IV had 
a variance of 1.0 and Set V a variance 
of 25.0. The same levels of probability 
were used in both sets: 1/6, 1/3, 1/2, 
2/3, and 5/6. All of the bets within 
each set were paired with one another 
and each of the resulting pairs was 
presented eight times.  Ninety-nine 
undergraduates were used as subjects. 

The second experiment was similar 
to the first in most respects but had 
six sets. Again the bets in Sets A, 
B, and C had a constant probability of 
winning and differed in variance. Set 
A had a probability of 1/3, Set B of 
1/2, and Set C of 2/3. Seven levels 
of variance were employed in each set: 
-12, 2.0, 5.1, 12.5, 21.2, 32.0, and 45.2. 
The bets in Sets D, E, and F had a 
constant variance and differed in the 
probability of winning. Set D had a 
variance of .12, Set E of 12.5, and 
Set F of 45.2. In each set there were 
three levels of probability: 1/3, 1/2, 
and 2/3. As in the case of the first 
experiment, the bets were exhaustively 
paired within Sets D, E, and F, and 
each pair was presented eight times. 
However, in the case of Sets A, B, 
and C, an accept-reject method was 
used instead of paired comparisons ; 
each bet was presented eight times 
and the subject was asked in each case 
to decide whether or not to accept the 
bet. Thirty-nine undergraduates were 
used as subjects. 

Before discussing the results, three 
things must be said about the relation- 
ship between these experimental oper- 
ations and the terms of the PLR 
model. (a) It can be easily. shown 
het tar n : pee 

ro, the pattern o om- 
Sicily determined by the probability 


of winning. Thus, the five probabili- 
ties of winning used in these experi- 
ments correspond to five patterns of 
risk: 1/6, 1/3, 1/2, 2/3, and 5/6. 
(b) Within the range of probabilities 
used in these experiments, two bets 
with different probabilities of winning 
(different patterns, that is) and the 
same variance will have roughly the 
same level of risk. And level of risk 
is monotonicly related to variance. 
Thus, the six levels of variance in the 
first experiment correspond roughly 
to six levels of risk, and the seven 
levels of variance in the second ex- 
periment to seven levels of risk. (c) 
the design of these experiments per- 
mits us to make three kinds of anal- 
ysis: In the case of stimulus Sets I, 
II, III, A, B, and C, we can study 
preferences among levels of risk while 
holding the pattern of risk constant. 
In the case of Sets IV, V, D, E, and 
F, we can study pattern preferences 
while holding level of risk constant. 
Finally, for each subject, we can com- 
pare his preferences among levels of 
risk with his pattern preferences to 
discover interactions. 

Proposition 1. At any level of risk, 
the order of preference among pat- 
terns of risk will be the same as at any 
other level of risk. 

The responses to Sets IV and V in 
the first experiment provide us an 
opportunity to compare each subject's 
order of preference among the five 
patterns (1/6 to 5/6) at a low level 
of risk (variance of 1.0) with his order 
of preference among the same five pat- 
terns at a high level of risk (variance 
of 25.0). Proposition 1 implies that 
these two preference orderings will be 
identical. 

Preference orderings were obtained 
from the data by assuming that subject 
preferred Bet A to Bet B if he chose 
A on more than half of the occasions 
when it was paired with B. Inspec- 
tion of the results revealed striking 
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- similarity between the orders of pref- 


erence at the two levels of risk, as 
predicted; but it is necessary to ex- 
press this similarity in a quantitative 
fashion. An index was therefore de- 
vised for expressing these preference 
orderings, which ranged from a low of 
1, for a pattern of consistent preference 
in all pairs for the smaller probability 
of winning, to a high of 11, for a 
pattern of consistent preference for 
the larger probability of winning. 
Two such ratings were computed for 
each subject, one to describe his pat- 
tern preferences in Set IV, the other 
to describe his preferences in Set V. 
The correlation, over subjects, be- 
tween these two ratings was .93, a 
highly significant result, indicating con- 
siderable stability in the order of pref- 
erence among patterns as one moves 
from a low to a high level of risk. 

Similar results were found in the 
second experiment for Sets D, E, 
and F. 

Proposition 2. The more preferred 
a pattern of risk, the higher will be 
the ideal level of risk for that pattern. 

Since all of the levels of risk within 
a set were exhaustively paired, it was 
possible to identify for each of the 
Sets, I, II, and III of the first experi- 
ment that level of risk which was 
preferred over all the others, i.e., the 
ideal level of risk. This meant than 
an ideal level of risk was identified 
for three patterns: 1/3 (Set I), 1/2 
(Set II), and 2/3 (Set IIT). An 
index was devised for expressing these 
ideals, which ranged from 1, indicating 
a preference for the lowest level of 
risk, to 16, indicating a preference for 
the highest available risk. (Since there 
exist levels of variance below .12 and 
above 100.0, this scale has an arbitrary 
Seiling and floor.) Preferences be- 
Ween the three patterns were ascer- 


&u, This and subsequent indices are thor- 
(Guy described in the previous report 


mbs & Pruitt, 1960). 


tained from the data of Sets IV and V 
combined. 

Proposition 2 was tested in the fol- 
lowing form: If one pattern is pre- 
ferred to another, there will be a 
higher ideal level of risk for the former 
than for the latter. Data supporting 
this hypothesis are shown in Table 1. 

In Table 1a the subjects are divided 
into two groups (columns) on the 
basis of which pattern they preferred, 
1/3 or 1/2. (Two subjects are omitted 
who did not have a preference between 
the two patterns.) Each of these 
two groups is divided into three sub- 
groups (rows) depending on whether 
the ideal level of risk for Pattern 1/3 
was greater than, equal to, or less than 
the ideal for Pattern 1/2. The results 
show a strong tendency for the more 
preferred pattern to have a higher 
ideal level of risk, as hypothesized. 
x? = 23.13 for the contingency table, 
with p<.01. The relatively large 
number of cases in which both ideals 
were equal might be thought to weaken 
this hypothesis. However, for most 
of them (25 out of 30), the most 
preferred level of risk for both pat- 
terns was at one or the other of the 
extreme ends of the range of levels of 
risk used in this experiment and there- 
fore did not necessarily reflect the 
true ideal. Had the range of values 
been greater, most of these cases 
would undoubtedly not have exhibited 
equal ideals. The degree to which 
these data support the hypothesis can 
perhaps best be summarized in terms 
of the proportion of cases correctly 
predicted: In 82% of the cases where 
there was a difference in ideals, the 
direction of this difference could be 
predicted by knowing which pattern 
was preferred. 

Table 1b shows the same sort of 
results for the comparison between 
Pattern 1/3 and Pattern 2/3. Again 
the hypothesis is strongly supported 
(2 = 35.77, p< 01), with correct 
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TABLE 1 


CONTINGENCY TABLES SHOWING RELATION 
BETWEEN PATTERN PREFERENCE AND 
IDEAL LEVEL oF RISK 


Pattern preferred 


Ideal 
i i Total 
Ideal for 3 greater than | 15 4 19 
for à 
Ideal for } equal to that | 13 17 30 
for à 


Ideal for 3 less than 8 40 48 
for 4 


Total 36 61 
a. 3vs. 3 
Pattern preferred 
Ideal —————— 
i i Total 
Ideal for 3 greater than | 29 4 33 
than $ 
Ideal for 4 equal to that | 12 6 18 
for $ 
Ideal for 3 less than 8 33 41 
for $ 
Total 49 43 
b. 3 vs. 3 
Pattern preferred 
Ideal [———— M 
i i Total 
Ideal for } greater than| 34 5 39 
for $ » 
Ideal for } equal to that} 12 18 30 
for $ 
Ideal for à less than} 12 15 27 
for $ 
Total 58 | 38 


c. $ vs. $ 


Note.—Entries are number of subjects. 


prediction in 84% of the cases where 
there was a difference in ideals. Six- 
teen of the 18 equal cases were at an 
extreme end of the range of levels of 
risk. Table 1c, for the Patterns 1/2 
and 2/3, also supports the hypothesis 
(x? = 19.65, p «.01), with correct 
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prediction in 74% of the cases where 
there was a difference in ideals. All 
30 equal cases were at an extreme. 

Proposition 3. The more preferred 
a pattern of risk, the higher will be 
the maximum acceptable level of risk 
for that pattern, 

Data concerning the maximum ac- 
ceptable level of risk was available 
from the responses to Sets A, B, and 
C of the second experiment, where the 
subjects were asked to accept or reject 
any or all of seven levels of risk for 
three patterns, 1/3, 1/2, and 2/3. A 
level of risk was assumed to be accep- 
able to the subject if he accepted it 
at least five out of eight times it was 
presented to him. An index was de- 
rived for each pattern which ranged 
from 0 (no level of risk acceptable) 
to 7 (highest risk Offered was the 
highest acceptable level). Preferences 
between the three patterns were ascer- 
tained in the usual way from the data 
of sets D, E, and F combined. 

Proposition 3 was tested in the fol- 
lowing form: If one pattern of risk is 
preferred to another, the maximum 
level of risk acceptable for the former 
will be greater than that acceptable 
for the latter. Data supporting this 
hypothesis are shown in Table 2. 

In all three subtables there is a 
strong association between preference 
for a pattern and willingness to accept 
a higher maximum level of risk at 
that pattern, as hypothesized. (For 
Table 2a, x? = 14.12, P< .01; for 
Table 2b, y*— 1736, b «01; for 
Table 2c, y? = 16.97, p < 01.) Again 
the relatively large number of cases in 
which the maximum acceptable levels 
of risk for both patterns are equal 
do not necessarily weaken the hypo- 
thesis since most of them occur at 
the extremes of the scale (12 out of 
17 in Table 2a; 7 out of 10 in Table 
2b; and 13 out of 14 in Table 26). 

Proposition 4. For any given pat- 
tern of risk, divide the levels of risk 
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into two groups: those smaller than 
the individual's ideal and those greater 
than his ideal. Within each group, 
levels of risk will be more preferred 
the closer they are to the ideal level 
of risk. 

Consider the data from Sets I, II, 


TABLE 2 


CONTINGENCY TABLES SHOWING RELATION 
BETWEEN PATTERN PREFERENCE AND 
Maximum ACCEPTABLE LEVEL OF 
Risk (MALR) 


Pattern preferred 
MALR 


i 4 Total 


MALR for } 
than for $ 
MALR for } equal to] 7 10 17 
that for $ 
MALR for } less than 0 14 14 
for à 


greater 6 2 8 


Total 13 26 


a. 4 vs. ] 


Pattern preferred 
MALR 
Total 


MALR for } 
than for 3 
MALR for } equal to 7 8 10 
that for $ 
MALR for } less than 1 13 14 
for $ 


greater| 12 3 15 


Total 20 19 


Pattern preferred 
MALR 


i 1 Total 


MALR for à 
than for $ 
MALR for 3 equal to} 4 | 10 | 14 
that for $ 
MALR for 2 less than 1 9 10 
for 3 


greater| 13 2 15 


Toi 18 | 21 


c. $ vs. $ 


Note Entries are number of subjects. 


and III of the first experiment. Prop- 
osition 4 implies that within each set, 
levels of risk above a subject's ideal 
will be preferred in order of ascending 
magnitude whereas levels of risk be- 
low his ideal will be preferred in order 
of descending magnitude, i.e., that the 
preference orderings will be folded J 
scales, in the terminology of the first 
report (Coombs & Pruitt, 1960). As 
shown in the first report, the results 
strongly supported this hypothesis: In 
Set I, out of 92 nonrandom orderings 
of preference among levels of risk, 81 
were of the postulated type. In Set II, 
89 out of 96 were of this type. In 
Set III, 83 out of 93 were of this type. 

Proposition 5. For any given pat- 
tern of risk, all levels of risk up to 
the maximum acceptable level of risk 
will be acceptable. 

Proposition 5 can easily be tested 
by an examination of the data from 
Sets A, B, and C of the second experi- 
ment. Within each set, the maximum 
acceptable level of risk is the highest 
level of risk which was accepted at 
least five out of eight times it was 
presented to the subject. The hypo- 
thesis is that within each set all levels 
of risk smaller than this maximum will 
also be accepted at least five out of the 
eight times they were presented. The 
hypothesis was confirmed in 110 of 
the 117 possible sets of data (3 sets 
times 39 subjects) ; which is, of course, 
exceedingly strong evidence for the 
proposition. 


Discussion 
Comparison with Other Models 


The four traditional models. The 
question naturally arises: Can the ex- 
perimental results which support the 
PLR model be predicted or explained 
by one or more of the four traditional 
models? If so, one might argue that 
there is no need for a new model. 

Two of these models yield clearcut 
predictions which are so grossly wrong 
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that the models can easily be rejected 
in the context of these data. The EV 
model is obviously inadequate because 
it predicts indifference between the 
bets of these experiments (which all 
have an EV of zero) whereas in 
reality consistent preference was the 
rule. The SEM model is also obvi- 
ously inadequate because, as mentioned 
earlier, it predicts that if a man likes 
a pattern at all he will insist on choos- 
ing the highest level of risk he can 
obtain for this pattern. Many sub- 
jects in the first experiment preferred 
moderate levels of risk, so that this 
prediction was not borne out. 

The other two models are less easy 
to disprove because their predictions 
are not clearcut unless utility of money 
is carefully measured, Though such 
measurement was not attempted in 
these experiments, it seems reasonable 
to reject the models on another basis, 
namely that there is nothing in either 
the EU or SEU models, as usually 
stated, which would lead one to pre- 
dict any of the regularities of behavior 
described in the five propositions and 
supported by the findings of these ex- 
periments. Perhaps, in the manner 
of the Procrustrean Bed, one of the 
traditional models could be stretched, 
new parameters added or more com- 
plex utility curves drawn to encompass 
these results. But the beauty of theory 
is its simplicity; so that it seems more 
reasonable to make striking new regu- 
larities of behavior such as these the 
basis of a new model. 

Again it must be stressed that the 
traditional models should not be com- 
pletely discarded; if possible, some 
common ground should be found be- 
tween the new model and what is good 
about the old ones. In the first ped 
it was shown that the model ys 

best fit the segment of data to — 
it was applied was the SEM ker. 
(Edwards, 1955). A way in whic 


Pruitt 


this model can be incorporated into 
the PLR model without increasing the 
complexity of the latter will be shown 
in the section on research implications. 

Probability and variance prefer- 
ences. The PLR does not cast aside 
the concepts of probability and vari- 
ance preference (Coombs & Pruitt, 
1960; Edwards, 1953) but rather pro- 
vides a larger framework within which 
they can be understood and related to 
one another. More specifically, the 
notion of “probability preference” is 
generalized in the concept "pattern 
preference," which embraces attitudes 
toward both probability and ratios of 
Outcomes. The earlier finding (Ed- 
wards, 1954b) that probability pref- 
erences are unaffected by changes in 
moncy level is generalized into Propo- 
sition 1. The notion of “variance 
preference” js translated in the con- 
cept “ideal level of risk": and the 
hypothesis that variance preferences 
fit a folded J scale of variance, proposed 
by Coombs and the author (1960), is 
expressed in Proposition 4, which is 
mainly dependent on the negatively ac- 
celerated character of the utility of 
risk curve. In addition, the PLR 
model goes a step beyond these earlier 
formulations by showing, in Proposi- 
tion 2, how probability and variance 
preferences are related to one another 
and permitting predictions concerning 
the maximum acceptable level of risk 
in Propositions 3 and Ey 


Research Implications 


One of the important features of 
the PLR model is the implication of 
Proposition 1 that pattern preferences 
can be studied independently of level 
of risk. If true, this greatly simplifies 
Tesearch on gambling by permitting 
study of one dimension at a time; 
but this assertion must itself be further 
examined in the context of a wider 
range of patterns. 
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In studying pattern preferences, one 
might ask the question, What are the 
dimensions of pattern which determine 
preferences among them? Two di- 
mensions which are suggested by the 
Structure of simple wagers (e.g., “I'll 
give you 5 to 3 on the Cubs”) are the 
subjective probabilities of outcomes 
and the ratios of objective outcomes to 
one another. For example, the utility 
of pattern in the PLR model could be 


, expressed by a subjectively expected 


ratio (SER) model, which would be 
quite similar to the SEM model and 
would, therefore, hopefully retain its 
best features (prediction of choices 
between bets differing in pattern when 
level of risk is held constant). The 
formula for computing the utility of 
pattern would be: 


u(px) = SER = zPe L6] 
where: 


t; equals |$;|/Z |$;], 


the ratio of the (absolute) amount of 
money in the ith outcome to the sum 
of the (absolute) amounts of money 
in all of the outcomes of the bet. 

One serious deficiency of the PLR 
model is its failure to predict to bets 
with no negative outcomes, where ry 
equals zero. There is a very simple 
modification which would make it 
universally applicable (and satisfy. 
the “sure thing" principle for bets 
with no negative outcomes), namely 
to add another objective parameter 
mx = E|$;| and rewrite the basic 

i 


formula: 


U(X) = mx-u(px) + g(rx) [7] 


Since my is a monotonic, positive func- 
um of ry for any pattern involving 
ue Propositions 2 through 5 are 

Altered by this procedure. Proposi- 


ti 3 
IOn | would have to be revised but 


not seriously. Since it has broader 
application, Formula 7 might have 
been used throughout this paper in- 
stead of Formula 5. However the 
latter is simpler and easier to present, 
and therefore was chosen. 


REFERENCES 


ArLAIs M. Le comportement de l'homme 
rationnel devant le risque. Econometrica, 
1953, 21, 503-546. 

Bernoutut, D. Exposition of a new theory 
on the measurement of risk. Econometrica, 
1954, 22, 23-36. 

Coowss, C. H., & Komorira, S. Measuring 
utility of money through decisions. Amer. 
J. Psychol., 1958, 71, 383-389. 

Coowxss, C. H., & Pnurrr, D. G. Com- 
ponents of risk in decision making: 
Probability and variance preferences. J. 
exp. Psychol., 1960, 60, 265-277. 

Davinson, D., Surres, P., & Srecet, S. De- 
cision making, an experimental approach. 
Stanford, Calif.: Stanford Univer. Press, 
1957. 

Epwarps, W. Probability preferences in 
gambling. Amer. J. Psychol., 1953, 66, 
349-364. 

Epwarps, W. Probability preferences 
among bets with differing expected values. 
Amer. J. Psychol., 1954, 67, 56-67. (a) 

Epwarps, W. The reliability of probability 
preferences. Amer. J. Psychol, 1954, 
67, 68-95. (b) 

Epwarps, W. Variance preferences in 
gambling. Amer. J. Psychol, 1954, 67, 
441-452. (c) 

Epwarps, W. 
among bets. 
201-214. 

Epwarps, W. Behavioral decision theory. 
Annu. Rev. Psychol., 1961, 12, 473-498. 

Hurst, P. M, & SrEGEL, S. Prediction 
of decisions from a higher ordered metric 
scale of utility. J. exp. Psychol, 1956, 
52, 138-144. 

MosrELLER, F., & Nocere, P. An experi- 
mental measurement of utility. J. polit. 
Econ., 1951, 59, 371-404. 

Snyper, G. Deterrence and power. J. 
conflict Resolut., 1960, 4, 163-178. 

Surres, P., & Wars, K. A non-linear 
model for the experimental measurement 
of utility. Behav. Sci., 1959, 3, 204-211. 

Tanner, W. P. Jr, & Swerts, J. A. A 
decision-making theory of visual detection. 
Psychol. Rev., 1954, 61, 401-409. 


(Received September 12, 1960) 


The prediction of decisions 
J. exp. Psychol, 1955, 50, 


hological Review 
Dava 69, No. 3, 202-219 


DISCRIMINATIVE CLASSICAL CONDITIONING IN DOGS 
PARALYZED BY CURARE CAN LATER CONTROL 
DISCRIMINATIVE AVOIDANCE RESPONSES 
IN THE NORMAL STATE: 


RICHARD L. SOLOMON AND 


University of Pennsylvania 


The use of curare drugs to immo- 
bilize subjects in psychological ex- 
periments has special advantages for 
research on learning, transfer of train- 
ing, and thinking. Subjects cannot 
give overt, skeletal responses, and 
therefore they cannot modify the ex- 
teroceptive or proprioceptive stimulus 
patterns that the experimenter wishes 
to control. 

We shall show that a discriminative, 
classical conditioning procedure carried 
out on curarized dogs can later control 
instrumental avoidance responses in the 
normal state. We shall describe an ex- 
periment in which dogs were first 
trained to avoid shock by pressing a 
panel whenever a signal light (S?) went 
out. The panel-press restored the light. 
Then, when the avoidance response was 
stable, the dogs were totally curarized. 
Under curare the dogs were presented 
on some trials with a tone (S*) fol- 
lowed 10 seconds later by a shock of 5 
seconds duration. A contrasting tone 
(S-) was presented on other trials, 
never followed by shock. A few days 
later, after complete recovery from cu- 
rare, the dogs were tested for avoidance 


1 This research was facilitated by the Lab- 
oratory of Social Relations, Harvard Uni- 
versity. Pilot studies over the past seven 
years were supported by grants from the 
Medical Sciences Division of the Rockefeller 
Foundation and a Ford Foundation Grant-in- 
Aid. The experiment in this paper was car- 
ried out with the support of a National 
Science Foundation Grant ce ae? 
We are grateful to P. D. Watson, A. H. 
Black, N. J. Carlson, and M. R "eeu dor 
their assistance during many pilot studies. 
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responding to S*, S", and S°, All dogs 
then showed the differentiating effects 


of the classical, discriminative condi- . 


tioning experience. The avoidance re- 
sponse to S* was usually of the same 
order of magnitude as that to S°, while 
avoidance Tesponding to S- was weak, 
absent, or quick to extinguish. We do 
not believe that precise discriminative 
transfer of training—from the classical 
conditioning of reflexes to the in- 
strumental learning of responses—has 
heretofore been demonstrated in this 
manner.? 

It is of considerable theoretical im- 
portance, as well as of empirical inter- 
est, to know that skeletal avoidance 
Tesponses acquired in the normal state 
can be controlled by classical condition- 
ing procedures carried out under cu- 
Tare. Some current versions of S-R 
reinforcement theory emphasize the 
importance of differential peripheral, 
Skeletal responses and their correlated 
proprioceptive cues in the development 
of discriminative instrumental acts. 


2? The general objective of our experiment 
certainly is not original, see Estes (1943, 
1948). In these experiments rats were sub- 
jected to Presentations of a tone followed by 
food, and then they were later trained to 
Press a lever for food reinforcement. Dur- 
ing a test period the rate of lever pressing 
Was increased by presentation of the tone al- 
though the lever pressing response had not 
previously been associated with the tone, 

he major difference between Estes' experi- 
ments and ours lies in our use of the curare 
paralysis during discriminative conditioning 
so as to eliminate the possibility of response 
generalization from the conditioning situation 
to the instrumental learning situation. 


| 
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Discriminative learning is hypothesized 
to take place by the action of differen- 
tial reinforcement on two different re- 
sponses to two different stimuli. 
Because we shall show that discrimi- 
native instrumental acts can be estab- 
lished without those acts having been 
reinforced differentially (because they 
could not occur under curare), then a 
revision of theories emphasizing the 
Importance of peripheral, skeletal re- 
sponding and proprioceptive cues seems 
definitely to be required. Such a re- 
vision was once the goal of "latent 
learning" studies. But they failed to 
give clearcut evidence because the S-R 
reinforcement theorist who emphasized 
peripheral response components could 
always claim: (a) that overt responses 
were occurring during the latent learn- 
ing period, (b) that some kind of rein- 
forcement was apt to occur following 
those responses, and (c) that those re- 
Sponses were similar in some way to 
the final required performances. These 
were cogent objections. They can be 
effectively circumvented, however, by 
the use of the curare-conditioning prep- 
aration. Stimulus sequences presented 
to the totally curarized subject cannot 
be accompanied by differential skeletal 
responding. In addition, skeletal re- 
Sponding cannot occur in the presence 
of differential reinforcement. There 
are no overt acts for the experimenter 
to reward or punish. Thus the carry- 
over or transfer of "information" ac- 
quired under curare so as to influence 
skeletal responses in the normal state 
could be mediated peripherally only by 
differential ANS responding or “anxi- 
ety." Whether or not such a transfer 
phenomenon would be considered to be 
a function of "pure" central mediation 
or latent learning, or whether the me- 
diation would be considered to be a 
function of anxiety, would, of course, 
depend on one’s theoretical biases. A 
Position emphasizing the mediation of 


learning and transfer by proprioceptive 
or kinesthetic cues suffers most in the 
light of our findings. 


History oF PROBLEM 


The theoretical issue to which our experi- 
ment is addressed is certainly an old one. In 
the days when psychology was dealing with 
mental content, there were those who main- 
tained that the response processes of the 
skeletal musculature were necessary for per- 
ception and thought. In 1887, F. M. Mueller 
declared that we cannot think without words. 
This position was espoused by Watson and 
by Max when they identified implicit lin- 
guistic responses with thought processes. 
Early behaviorism was peripheralistic in na- 
ture. But so were the systematic approaches 
of many students of consciousness. William 
James emphasized peripheral response events 
in the production of emotional feelings. E. 
Jacobson maintained that awareness itself 
was a by-product of motor reactions. For 
example, he argued that muscular relaxation 
was accompanied by the disappearance of 
imagery and thinking. 

While such peripheralistic interpretations 
of mental events were useful to the early 
behaviorists, the introspective methods used 
to arrive at the so-called “motor theory of 
consciousness" were shunned by them, At 
the hands of behaviorists, precise laboratory 
experimentation, aimed at the study of overt 
behavior as a function of stimulation con- 
ditions, replaced introspection, But the 
major idea of the “motor theory of con- 
sciousness—that the important mental proc- 
esses such as perception, feeling, thinking, 
and imagining were caused by peripheral 
motor reactions—was retained by the periph- 
eralistic behaviorists in new form. Complex 
behavioral phenomena, difficult to place in the 
simple S-R formula, were interpreted in a 
peripheralistic way. Thus, animals did not 
think, but instead they showed transfer of 
training mediated by chains or organizations 
of “response-produced cues.” Animals did not 
“expect,” but instead they maintained their 
future-orientations by virtue of response-pro- 
duced cues which served to bridge time. Im- 
plicit anticipatory goal responses became a 
major conceptual tool of the Hullians, while 
differential proprioceptive and kinesthetic 
cues carried the Skinnerian subjects over 
their toughest problem solving challenges. 
Of course, many of the old theoretical contro- 
versies revolved around the validity of these 
peripheralistic positions. i 

Harlow and Stagner (1933), following a 
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suggestion from Donald G. Marquis, were 
the first to see the importance of a paralyzing 
drug in hitting at the crux of the peripheral- 
ist-centralist controversies, whether the con- 
troversies concerned the motor theory of 
consciousness or the latent learning concept 
and its challenge to peripheralistic learning 
theory. These experimenters argued that, if 
the motor theory of consciousness were cor- 
rect, conditioning in paralyzed animals should 
not transfer to the normal state, since the 
conditioning experience would be ineffective, 
They further pointed out that if discrimina- 
tion learning were a function of the differen- 
tial reinforcement of overt skeletal acts, then 
conditioning in paralyzed animals should not 
transfer to the normal state, since there 
would be no differential responses in the 
paralyzed state. 
Harlow and Stagner used the drug curare 
in their experiments. Curare produced a 
complete flaccid paralysis in animal subjects. 
Its action was at that time thought to consist 
mainly of chemical blocking at the junction 
of the motor axon and muscle cell. So cu- 
rare could remove skeletal motor responses 
and their proprioceptive feedback and create 
a preparation in which both the motor theory 
of consciousness and the theory of latent 
learning could be tested. Therefore, the ex- 
periments by Harlow and Stagner (1933) 
were considered to be crucial. Harlow and 
Stagner studied both instrumental avoidance 
learning and pupillary conditioning in cura- 
rized cats and dogs. In one experiment, they 
first pretested subjects for aversion to the fol- 
lowing stimuli in the experimental room: a 
chair, bell, light, and buzzer. Then the sub- 
jects were put on a metallic grid, and the 
latency of jumping off it, in response to the 
various pretested stimuli, was measured, 
Finally, all subjects were curarized to the 
stage where there was “absence of muscle 
twitches, gag reflex, and the corneal eye-lid 
reflex” (Harlow & Stagner, 1933, p. 287). 
Then followed 30 pairings of conditioned 
stimulus and electric shock to the grid on 
which the subjects had been placed. Under 
such treatment a normal, undrugged control 
group learned to jump off the grid in re- 
sponse to the conditioned stimulus (CS) in 
three trials or less. When the curarized 
subjects had recovered from the effects of the 
drug, they were reintroduced to the grid, 
presented with the CS, but showed not the 
slightest evidence of learning. 
In a second experiment, Harlow oe 
in sequence to curariz ub- 
e vx wi bell, zd a shock. The pupil 
eae council to the light, dilated to the 
eh but showed no change to the bell. But 


after two to six presentations of the three- 
stimulus sequence to curarized subjects, the 
pupillary dilation, originally elicited by shock, 
became elicitable by the bell alone. Said the 
experimenters (Harlow & Stagner, 1933): 


It thus appears certain that cortical activ- 
ity (to the extent required for the acquisi- 
tion of the conditioned pupillary response) 
is not depressed by curare, at least in the 
amounts used in this experiment (p. 289). 


Then two extinction experiments were car- 
ried out. Three cats were trained to leap 
from the electrified grid in response to a CS. 
Later, when curarized and placed on the grid, 
they were presented with the CS alone. 
When the subjects recovered from curariza- 
tion, and were placed on the grid again, 
they leaped from the grid with short re- 
sponse latencies. In another group, pupillary 
dilation was conditioned while the subjects 
were in the normal state. Then, while cura- 
rized, the CS was presented from 100 to 150 
times without the shock, and the conditioned 
dilation disappeared. Yet skeletal avoidance 
responses were later shown to have remained 
intact in these same subjects. Harlow and 
Stagner felt that they had established the 
fact that adaptive behavior is not learned 
under curare, However, they asked whether 
it might be possible that CS-US pairings un- 
der curare might have central effects which 
aren't organized in such a Way so as to pro- 
duce à specific response, A savings test 
might reveal this, Therefore, two control 
Puppies were given 30 CS-US sequences 
under curare, then trained again later in the 
normal state, They were compared to two 
puppies which were trained only in the nor- 
mal state. There were no differences be- 
tween the two pairs of Subjects in either 
acquisition or extinction rates. In another 
experiment, conditioned forelimb flexion was 
used (more limited and segmental response 
than jumping off the grid), but it revealed 
no learning under curare. The conclusion 
reached by Harlow and Stagner (1933) was 
this; "ijt appears that presentation of 
stimuli alone wil] not cause learning if no 
reaction is made” (p. 293). The conclusion 
hit hard at latent learning theory. 

In direct contradiction to the opinion 
of Harlow and Stagner, Light and Gantt 
(1936), using a different method, concluded 
that “. | the peripheral nerve and executor 
organ are not necessary for conditioned 
reflex formation" (p. 35). In four dogs the 
right hind leg was paralyzed by crushing the 
anterior nerve roots. Before regeneration of 
the injured nerves, simple conditioning was 
carried out on the paralyzed side. A CS was 
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paired with shock to the immobile leg; when 
a diffuse conditioned response (CR), con- 
sisting of howling and struggling, was estab- 
lished, training was stopped. Later, after 
regeneration of the damaged motor nerves, 
the CS was presented, (Light & Gantt, 1936) 
“and it was followed by withdrawal of the 
formerly paralyzed leg—the appropriate and 
specific conditioned movement, but one which 
was never possible during the period of train- 
ing” (p. 35). At the time it was certainly 
possible that the Harlow and Stagner experi- 
ment was not inconsistent with this result. 
After all, Light and Gantt had used a class 
cal conditioning procedure; and this pro- 
cedure had produced pupillary conditioning 
in the immobilized state for Harlow and 
Stagner. (The psycho-pharmacological prop- 
erties of the curare used by Harlow and 
Stagner had not yet come under scrutiny. ) 
Then, too, the Crisler (1930) findings had 
indicated that classical conditioning could 
occur in the case of a blocked salivary re- 
action. Perhaps the distinction that would 
resolve the discrepancy was that between 
classical conditioning and instrumental train- 
ing. Perhaps the conclusion of Harlow and 
Stanger applied only to the latter procedure? 

Settlage (1936), using a Pavlovian delayed 
conditioning procedure (CS-US interval of 
2 seconds), attempted to condition 12 cats 
under sodium amytal. The drug level was 
adjusted so that when the CS was presented, 
the subjects gave no anticipatory limb with- 
drawals. Then conditioning pairings of CS 
and shock were given, ranging from 10 to 50 
in number from stimulus to stimulus. Later, 
in the normal state the CS was presented 
alone. A high percentage of limb with- 
drawals was noted. Settlage felt that a new 
S-R connection was established under sodium 
amytal. However, this finding was not 
critical for a peripheral-motor learning po- 
sition because, while under amytal the un- 
conditioned stimulus (US) was often capable 
of eliciting a limb withdrawal, even though 
the CS was not. It was, however, an inter- 
esting and instructive finding. Here was a 
case where the organism was in a state such 
that an unconditioned response (UR) could 
be elicited yet no CR could emerge; however, 
the CR was later shown to have developed 
normally under sodium amytal immobiliza- 
tion, since it occurred when the CS was 
Dresented to the animals after they had re- 
Covered from drug effects. 

The psycho-pharmocological complications 
9f the use of curare were now to come under 
mutiny in a puzzling series of experiments, 
fra” by Girden and his collaborators. The 

St of these (Girden & Culler, 1937) demon- 
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strated certain disturbing side effects of the 
drug on the function of the brain. The semi- 
tendinosus muscle of the dog was dissected 
out, leaving its circulation and innervation 
intact. Then the subject was presented with a 
2-second bell followed by a 0.5-second shock 
to the paw of the prepared leg while the sub- 
ject was totally curarized. However, the US 
did elicit a twitch of the semitendinosus 
muscle, even though it did not produce a limb 
movement. A good semitendinosus muscle 
twitch CR developed in response to presenta- 
tion of the CS under curare. Yet this CR 
vanished when the dogs were allowed to re- 
cover from the effects of curare. The CR 
reappeared when the dogs were curarized 
again. When a group of dogs was condi- 
tioned in the normal state, then tested under 
curare, there was no detectable CR of the 
semitendinosus muscle. Yet the CR reap- 
peared when the effects of curare wore off. 
There thus was a lack of transferability of 
conditioning from one state to the other. 
Girden and Culler (1937) attributed the find- 
ings to a separation of cortical and sub- 
cortical conditioning processes; they said: 


It is thus conceivable that under curare the 
normal cortical dominance is inhibited, and 
that conditioning therefore occurs at sub- 
cortical levels (thalamus). When the ani- 
mal revives, the cortex again functions 
normally and the (conditioned) thalamic 
activities are inhibited. Likewise the CR 
established in the normal animal (with 
participation of the cortex) is inhibited 
under curare (due to general inhibition of 
the cortex) (p. 272). 


Whether or not this interpretation would 
endure, it was, however, certain that the 
curare used in these experiments produced 
complicated central nervous system effects ; it 
was not merely a peripheral-motor blocking 
agent. 

Harlow and Settlage (1939) designed an 
experiment to detect the effect of curarization 
of the upper body upon the ability of cats to 
display previously learned responses of the 
lower body. The subjects’ aortas were pre- 
pared so that they could be tied off in the 
abdominal region. Then the subjects were 
conditioned to give vigorous hind leg with- 
drawals, using a shock as US, and a bell as 
CS. After conditioning, the aortas were 
clamped off and curare was injected into the 
upper body. The upper body parts became 
paralyzed, while the lower body parts could 
move until they became cyanotic due to the 
occluded blood supply. Vigorous CRs per- 
sisted for short periods, perhaps no longer 
than 3 minutes after curarization. It was 
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not possible to tell whether the disappear- 
ance of CRs was due to collateral circulation 
of curare to the legs or to lack of adequate 
blood supply. The experimenters felt that 
they had demonstrated that curare does not 
immediately affect the CNS, at least not for 
3 minutes following injection. 

Culler, Coakley, Shurrager, and Ades 
(1939) bared the spinal roots and the cere- 
bral cortices of nine dogs. They located the 
spinal root locus and cerebral cortex locus 
for electrical elicitation of twitches of the 
semitendinosus muscle. Then they curarized 
the dogs and measured the stimulation inten- 
sities necessary to produce a constant magni- 
tude of excursion of the semitendinosus 
muscle. They found that when they com- 
pared these stimulation intensities for cura- 
rized compared to normal dogs, a striking 
result emerged. Curare elevated the excita- 
tion time as well as the rheobase for the 
cortex, while there was almost no effect, and 
perhaps the opposite, at the ventral root. 
'The experimenters felt that somewhere be- 
tween the cortex and the ventral spinal root 
lies a plane of neural cleavage above which 
curare is a depressant. While this might 
explain the findings of Girden and Culler 
(1937), it still did not agree accurately with 
the observations of Harlow and Stagner 
(1933) on the successful development of 
pupillary conditioning under curare, unless 
one would assume further that pupillary con- 
ditioning occurs below the postulated plane 
of neural cleavage. 

Harlow accepted the conclusion that cu- 
rare was a CNS depressant. However, he 
felt that if dosage could be adequately ad- 
justed, one might produce sufficient immo- 
bilization without severe CNS side effects. 
Therefore, in an extensive series of experi- 
ments, he employed what he called “incom- 
plete curarization." Using 11 cats that were 
incompletely paralyzed, he established con- 
ditioned escape responses (with a 4-second 
CS and 1-second shock) with some difficulty. 
In contrast, conditioned pupillary reactions 
developed quickly. Later in the normal 
state, the conditioned escape responses were 
vigorous, as were the pupillary reactions. 
Harlow felt that curare is a CNS depressant; 
but in light doses it permits transferable CRs 
to be formed. He (Harlow, 1940) further 
stated that a failure of learning to proceed 
normally under curare was due to the cere- 
bral depressant effects. “Thus the data from 

the curare experiments cannot be used to 
support the motor theory of learning, and 
clearly refute such a theory if it is to be 
applied to all learning phenomena (p. 281). 
This conclusion was not very convincing 


when applied to learned skeletal responses. 
After all, Harlow’s subjects had been able 
to perform escape responses during the cu- 
rarization period. 

Then followed as persevering and careful 
a series of experiments as one can find in 
modern experimental psychology. Over a 
span of years from about 1938 to 1947, Ed- 
ward Girden elucidated the behavioral prop- 
erties of the curare drugs then available 
(curare and erythroidine) ; our knowledge 
of “dissociation” under curare comes from 
his work. Girden (1940) performed a bilat- 
eral surgical extirpation of the cortical audi- 
tory areas of dogs. After recovery from the 
aftereffects of surgery, the semitendinosus 
muscle was exposed. Then the dogs were 
subjected to a conditioning procedure iden- 
tical to that used in Girden’s 1937 experi- 
ment (Girden & Culler, 1937), using an 
auditory CS and shock US. Dogs condi- 
tioned while curarized gave good CRs later 
in the normal state; and dogs conditioned in 
the normal state gave CRs later when cu- 
rarized. This outcome was in direct contrast 
to that of the earlier experiment. Here is 


what Girden (1940) thought the results 
meant: 


It is clear from the present results that 
conditioned responses of the semitendino- 
sus muscle to an auditory stimulus in the 
normal animal are mediated by cortical 
pathways, whereas the CRs developed 
under curare involve subcortical systems. 
When a section of the cortical pathway is 
literally extirpated, the block between the 
normal and curare states is disrupted; the 
CRs established in either of the two con- 
ditions will now be manifested in the other 
state (p. 404). 


Girden voiced the conviction that normal 
animals conditioned under curare are prob- 
ably unconscious. Furthermore, he (Girden, 
1940) entertained the idea that even in his 
dogs with removed auditory cortex the same 
condition might have existed. 


In the partial decorticate, the amnesic con- 
dition is disrupted so that the curare CR 
now appears in the subsequent normal 
period. But the animal still appears to be 
unaware of what had occurred under the 
drug. That is, the animal does not mani- 
fest the diffuse struggle behavior ordi- 
narily present during the first stages of 
conditioning. . . . It seems as if the animal 
is still unaware of what had happened 
under the drug, the semitendinosus CR 
occurring much like the motor automa- 
tisms reported in hysterics (pp. 405-406). 
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If this conclusion were true, then the useful- 
ness of curare, for the investigation of learn- 
ing without skeletal responding, would cer- 
tainly be nil. 

Girden (1942a) used intact dogs rather 
than the excised semitendinosus preparation. 
In addition to the old curare drug, he used 
erythroidine, a variant of curare. Some 
subjects were conditioned to give defensive 
limb withdrawals to a CS while in the nor- 
mal state. They were later tested in the 
drugged state and showed no retention. 
Some subjects were conditioned by CS-US 
pairings while drugged and were later tested 
while normal, These showed no retention 
either. Even when the dosages of curare or 
erythroidine were light, permitting gross 
bodily responses to the US during condition- 
ing in the drugged state, there was no reten- 
tion in the normal state. This dissociation 
phenomenon was likened to amnesia. 

Girden (1942b) then extended his studies 
to the conditioning of visceral reactions. A 
glass canula was inserted in the common 
carotid artery of the dog for purposes of 
blood pressure recording. Using a light 
flash as CS and shock to forepaw as US, 
and paralyzing the dogs with erythroidine, 
Girden showed that a blood pressure increase 
could be readily conditioned, extinguished, 
and reconditioned. However, the retention 
of a blood pressure CR in the normal state 
was not measured. On this omission, Girden 
(1942b) said: “It is safe to infer, however, 
that the blood pressure CR, developed in the 
drug state, was suppressed after recovery 
as were the other components of the drug 
state CR which have been studied” (p. 230). 
This conclusion was based on the fact that 
Girden’s dogs, after recovery from drug 
effects, did not respond to the CS with 
startle, respiration changes, or pulse rate 
changes. 

In view of the fact that Harlow and 
Stagner (1933) had believed that the con- 
ditioned pupillary reflex, established under 
curare, was retained later in the undrugged 
state, the question of dissociation for vis- 
ceral or autonomic CRs was cértainly not 
settled yet. Therefore, Girden (1942c) re- 
turned to the study of the conditioned pupil- 
lary reaction. In dogs immobilized by 
either curare or erythroidine, a conditioned 
pupillary dilation was established, using a 
bell CS and shock to forepaw as US. The 
conditioned reaction was shown to be due 
to sympathetic innervation, and not due to 
inhibition of parasympathetic innervation, 
since this CR failed to develop if sym- 
Pathetic fibers to the pupil were cut. The 
Dupillary CR was readily extinguishable in 


the drug state. But the CRs established in 
the drug state did not transfer to the later 
normal state. So Girden (1942c) stated: 
“We cannot concur, therefore, with the con- 
clusion that ‘curare produces different in- 
fluences upon the various components of a 
total conditioned response pattern'" (p. 
331). He was in complete disagreement 
with Harlow and Stagner (1933) on this 
point. In addition, Girden, in order to main- 
tain that the dissociation phenomenon was 
due to a block between cortical and sub- 
cortical functions, would have had to as- 
sume that pupillary conditioning in the 
normal state was cortically mediated. He 
was, however, silent on this point at the 
time. Later, Girden (1943a) experimented 
with different levels of erythroidine dosage, 
producing mild to very deep paralysis in 
dogs. He found that autonomic CRs were 
established under erythroidine but were not 
detectable later in the normal state, no 
matter how deep the flaccid paralysis of 
striate muscles had been. He further found 
that only those striate muscle reactions ac- 
tually made under erythroidine, can become 
conditioned. When they do, they are part 
of a generalized "excited" emotional reac- 
tion pattern. Girden conjectured that a 
human subject, rendered partially immobile 
by erythroidine, would be unconscious; but 
later on in the normal state, any autonomic 
CRs established in the drug state would 
be rearoused. The subject would perceive 
his own autonomic upset without knowing 
how it had developed. Such autonomic CRs 
would only transfer to the normal state if 
they were artifacts of a general skeletal 
emotional response pattern which had oc- 
curred under light paralysis. Thus Girden 
tried to tie together autonomic and skeletal 
CRs into an emotional reaction pattern. 
The autonomic reactions are side effects of 
the skeletal emotional responses. With deep 
paralysis, therefore, autonomic CRs would 
not later transfer to the normal state. 

In the same year, Girden (1943b) showed 
that the EEG in rhesus monkeys and dogs 
remained normal under flaccid paralysis in- 
duced by erythroidine. This was true even 
after repeated dosages of erythroidine had 
immobilized subjects for as long as 4 hours. 
This finding put in doubt the role of cortical 
depression in the dissociation phenomenon. 
But it did show that artificial respiration, 
required in deeply curarized subjects, was 
adequate for normal brain function in this 
and previous experiments in Girden's labora- 
tory. This was no trivial finding, in view of 
the peculiar amnesias found with curare im- 
mobilization. Finally, Girden (1947) ex- 
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tended his studies to rhesus monkeys. Using 
erythroidine, four subjects were avoidance 
trained under light dosages. Using both 
visual and acoustic CSs, CRs rapidly de- 
veloped under erythroidine. Girden (1947) 
reported that such CRs often developed 
more rapidly in the drug state than in the 
normal state. 


As previously observed in the dog, there 
is a functional dissociation in the curarized 
monkey with the result that drug state 
learning is completely repressed upon com- 
plete recovery from the drug. The CR 
appears spontaneously when the monkey 
is again curarized (p. 587). 


The development of autonomic CRs was 
normal under erythroidine, but there was 
no transfer to the normal state. We wish 
to emphasize the striking finding that dis- 
crete, localized skeletal CRs, as well as 
autonomic CRs, were found to develop faster 
in the drug state than in the normal state. 
Girden had previously noted this in dogs, 
and had commented upon it. In one study, 
using the semitendinosus response, 17 sub- 
jects had reached an arbitrary conditioning 
criterion under curare in 23 to 125 trials. 
In the normal state, another group of sub- 
jects had required 100 to 250 trials to reach 
criterion. The observations on monkeys 
confirmed this finding. 

Morgan, (1951) in tracing the history of 
the use of curare and erythroidine in be- 
havioral experiments, accepted Girden's gen- 
eral conclusions (Morgan, 1951) : “Thus the 
story is now simple and clear. Conditioning 
reactions, whether skeletal or autonomic, 
established under curare or erythroidine, do 
not carry over to the normal state and vice 
versa" (p. 771). Clearly these two drugs 
could not be legitimately used for the orig- 
inal purpose of the Harlow and Stagner 
experiment. If one wished to study the im- 
portance of the peripheral skeletal response 
in learning, other immobilizing techniques 
would be required. ' 

It turned out that Girden stopped his ex- 
perimentation on curare drugs too soon. A 
new curare derivative, d-tubocurarine, had 
been administered to a human subject for 
experimental purposes in 1945. The object 
was to test its efficacy as an anesthetic. For 
several years prior to this time, both curare 

and erythroidine had been used as an anes- 
thetic supplement for surgery. aig onm 
they had been used alone for surgery. They 
were effective muscular relaxants, and as 
were an aid to surgical procedures. 


such 


However, when d-tubocurarine was used 
alone as an anesthetic, sometimes the pa- 
tient would complain after the operation 
that he had experienced severe pain. This 
had not happened with curare or erythroi- 
dine. Therefore, Smith, Brown, Toman, and 
Goodman (1947) used the new paralyzing 
drug, d-tubocurarine, on Smith to see 
whether it had anesthetic or cerebral de- 
pressant properties. Smith was given a dose 
large enough to induce total flaccid paraly- 
sis. He was artificially respirated. The 
other three experimenters presented their 
colleague with a wide variety of external 
stimuli, some painful, while he was immo- 
bile. After the subject recovered he was 
able to relate in accurate detail most of 
what had transpired. He said his “senses” 
were extremely "clear" under d-tubocura- 
rine. His blood pressure remained normal 
during the procedure, and except for the 
discomfort of needing to cough and being 
unable to do so, suffered no ill effects. This 
outcome, the authors stated, agrees with the 
finding of Girden (1948) that curare drugs 
do not depress the EEG as long as arti- 
ficial respiration is adequate. It is not clear 
why Smith et al. (1947) ignore Girden's 
work on dissociation while accepting that 
on the EEG. They state that Girden, em- 
ploying CR techniques with animals, be- 
lieved his results to indicate that curare 
causes unconsciousness or amnesia; but they 
add that his evidence is not very conclusive. 
On the contrary, we would feel that his 
evidence was good, and it agreed with the 
clinical findings on the old curare as an 
anesthetic agent. Clearly, d-tubocurarine 
was quite different from curare and erythroi- 
dine in its psychological action. 

The Smith et al. (1947) findings have 
been partially confirmed by perceptual tests 
with human subjects. McIntyre, Bennett, 
and Hamilton (1951) found that doses of 
d-tubocurarine left CNS processes unaf- 
fected. Unna and Pelikan (1951) found, 
with human subjects (six healthy volun- 
teers) that small, subparalytic doses of d- 
tubocurarine did not alter a wide variety 
of Perceptual functions. They (Unna & 
Pelikan, 1951) stated: “No evidence was 
obtained of any action other than on the 
neuromuscular junction. ... In particular, 
no effects on autonomic organs and also 
none on cerebral functions could be demon- 
strated . . ." (p. 480). Lauer (1951) dem- 
onstrated the transfer of learned forelimb 
flexion in two dogs from the immobile state 
to the normal state, using d-tubocurarine to 
produce complete paralysis. Beck and Doty 
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(1957) have confirmed the same type of 
finding in cats, using a cataleptic compound, 
bulbocapnine. In an extensive monograph 
now in press, Black, Carlson, and Solomon 
(1962) have demonstrated several related 
phenomena of autonomic conditioning in 
dogs paralyzed by curare. They have con- 
firmed the fact that the more recent curare 
drugs (flaxedil, d-tubocurarine) produce a 
complete flaccid skeletal paralysis without 
the dissociative or amnesic effects produced 
by the curare drugs used previously in the 
pioneering work by Harlow and Stagner 
(1933), Girden (1940, 1942a, 1942b), and 
others. The classical conditioning of ANS 
reactions in curarized dogs has been shown 
to transfer to conditioned reflexes in the 
normal, undrugged state. In addition, the 
development of ANS discriminative, condi- 
tioned reactions has been demonstrated in 
curarized dogs. This means that ANS con- 
ditioning is not merely an artifact of skeletal 
responding or its proprioceptive feedback ; 
it means also that the original experimental 
idea. of Harlow and Stagner can now be 
carried out as a satisfactory test of the 
"central process" versus "peripheral process" 
interpretations of learning and transfer. 


METHOD 
Subjects 


Subjects were six healthy mongrel dogs 
obtained from the Harvard Medical School 
animal farm. They were of medium size 
(10-12 kilograms). Their previous histories 
were unknown because they were strays. 


Apparatus 


The apparatus has been described in some 
detail by Black (1958). A Pavlov-type 
harness held the subject in a comfortable, 
suspended posture. Two panels were placed 
on either side of the subject's head. The 
stimulus light (S°) was placed 1 foot above 
and 4 feet in front of that subject's head. 
The light was provided by a 40-watt bulb. 
The tones (S* and S-) were provided by 
two Bud Code-practice oscillators, set at 
approximately 160 cycles per second and 
1200 cycles per second. They were placed 
4 feet in front of, and 2 feet above the dog's 
head. The amplitude was set at a level that 
Seemed nonaversive to the experimenters. 

Appropriate interval timers, relays, and 
Programming equipment were used to con- 
trol the CS-US interval, US duration, and 
intertrial intervals. The accuracy of the 


CS-US interval timer was 10 seconds + 0.05 
second. The accuracy of the shock dura- 
tion timer was 5 seconds + 0.02 second. 

The EKG was recorded on a Grass four- 
channel polygraph. Alligator clip elec- 
trodes, attached to shaved skin areas, were 
used to pick up EKG impulses. The shock 
electrodes were those described by Black 
(1958) ; they were brass plates embedded 
in Bakelite, and they were laced tightly to 
the hind toe pads. 


Procedure 


Stage onc—avoidance training. The sub- 
jects were placed in the harness, EKG and 
shock electrodes were attached, and the 
panels were adjusted so that each dog could 
readily press either panel The experi- 
menters then leít the experimental room. 
Lights were turned off (with the exception 
of the signal light and one dim 5-watt lamp 
which hung above and behind the subject). 
The signal light shone directly on the sub- 
ject’s face. In the door to the experimental 
room was a one-way mirror through which 
the experimenters observed the subjects. 

When the initial struggling period seemed 
over and the subject was calm, training was 
begun. At least three test trials were given 
at the outset, in order to be sure that there 
was no initial panel-pressing response to the 
S? (light out). Then the S? was paired 
with shock, with a CS-US interval of 10 
seconds. Shock stayed on until the subject 
pressed either panel, at which moment so 
and shock were simultaneously terminated. 
The intertrial intervals were either 1, 14, 
or 2 minutes, presented in random order. 
Stimulus presentation, trial presentation, re- 
cording of responses, as well as control of 
the S? and shock onset, were automated so 
that the subject could be left unattended. 
If the subject pressed a panel in less than 
10 seconds following CS onset, then S? was 
instantaneously removed and no shock was 
presented. The subjects were trained until 
they gave reliable avoidance responding. 
Pressing either panel prevented shock. A 
criterion of at least 20 consecutive avoid- 
ance responses was required, but each sub- 
ject was trained to a different criterion. 
The differing avoidance criteria were used 
in order to vary the number of trials since 
the last shock was received prior to curare- 
conditioning. Then the subjects were tested 
for sensitization by presentations of the two 
stimuli which were to be used in the next 
stage of the experiment. A 160 cycles per 
second tone and a 1200 cycles per second 


210 Ricuamgp L. SOLOMON AND LUCILLE H. Turner 


tone were presented three times each, in a 
random order. ) 

Stage  two—conditioning under curare. 
After the subject met the avoidance cri- 
terion, he was curarized. D-tubocurarine 
was administered ie, with a dose of at 
least 0.7 cubic centimeter per kilogram, or 
2.0 milligrams per kilogram, needed to en- 
sure total flaccid paralysis. A gradual in- 
jection procedure was used, starting with 
an initial dose of 6.0 milligrams, and therc- 
after injecting 0.3 milligram per minute. 
When respiration stopped, usually 30 sec- 
onds after an initial 6.0 milligrams dose, an 
endotracheal tube was inserted, and arti- 
ficial respiration was begun. 

Conditioning was started when the subject 
showed no respiratory movements, no cor- 
neal lid reflex, no ear or tongue twitches, 
and no paw movements. Conditioning trials 
were carried out in the Pavlovian manner. 
A delayed conditioning procedure was used 
with a 10-second CS-US interval and a 
shock of 5 seconds duration and of 4 milli- 
amperes intensity. Thus the CS lasted 15 
seconds, and the US occurred during the 
last 5 seconds of this interval. The CS was 
terminated at the moment of shock termina- 
tion. Two stimuli were used: S*, a tone 
paired with shock, and S^, a tone never 
paired with shock. The 160 Cycles per 
second tone was S* for dogs number 21, 22, 
23, and 25. The 1200 cycles per second tone 
was S* for dogs number 32 and 33. 

All six subjects were given 99 curare- 
conditioning trials. The following sequence, 
PT hak pia ahs was 
repeated five times and then was followed 
by a sequence of 19 alternated + and — 
trials, ending on the 99th trial with a+ 
trial. 

Stage three—transfer tests. After the 
curare-conditioning session, all subjects were 
given at least 48 hours to recover from the 
side effects of curarization. Then they 
were tested in the normal state in the same 
apparatus as that used previously for both 
avoidance training and curare-conditioning. 
All three CSs were presented, and the be- 
havior of the six subjects in response to 
these stimuli was recorded. No shocks were 
given, so the test sessions were extinction 

ions. . 

The order of test presentations of the 
three types of CSs presented serious prob- 
lems of potential biasing. While there vr 
no shocks presented on these test ar 
there was the possibility that. order e me 
could either enhance or diminish is i 
pressing responses to the three CSs. For 


example, if the subject were at first pre- 
sented with a series of S- presentations 
and did not respond, there was the 
possibility that responses to the S* or S? 
might become less probable due to some 
iype of generalization of inhibition. Or, 
perhaps, if a long series of S9 trials were 
presented, this might facilitate responding 
to S- and S* due to some type of general- 
ization of excitation or a sensitization (or 
pseudoconditioning) phenomenon. We were 
not certain how to solve this problem with- 
out a very large number of subjects. There- 
fore, we explored different orders of CS 
presentation with different subjects. The 
purpose was to eliminate biasing for cach 
subject, in order to determine whether 
the three CSs had differential elicitation 
strengths. 

Latencies of the panel-pressing responses 
Were measured from CS onset to the time 
the panel was depressed 1 inch. The CS 
was arbitrarily terminated if the subject 
did not press the panel in 15 seconds (the 
time interval initially required for the 10- 
second CS-US interval plus the 5-second 
shock duration period). If no response was 
made in 15 Seconds, it was recorded as an 
infinite latency. It was known from previ- 
ous studies that the termination of the CS 
after a short time period facilitates extinc- 
tion. Therefore, we were using conservative 
conditions for the demonstration of response 
differentiation. 


RrsurTs 


In Table 1 are listed the total num- 
ber of shocks received by each sub- 
ject during avoidance training and the 
number of consecutive avoidance re- 


TABLE 1 


Suoeks RECEIVED, AND REsPoNsEs ro REACH 
CRITERION, FOR ALL SUBJECTS 


Subject's Number Consecutive 


Name of avoidance 

number | sei | ARE | Rupe 
21 Harry 46 25 
22 Bugeye 43 58 
23 Buff 88 20 
25 Nipper 52 20 
32 Punc 119 77 
33 Mahog 267 88 
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sponses made in reaching the arbitrary 
avoidance learning criterion. 

The latencies of panel-pressing are 
the more important observations of this 
experiment; these are plotted in Fig- 
ures 1-6 for the last 20 training trials 
prior to curare-conditioning and for 
the test trials after recovery from cu- 
rare effects. The data on sensitization 
tests are not plotted since all these 
pretest responses to S* and S- were 
of infinite latency. In Figures 1-6 re- 
sponses to S? are indexed by filled 
circles; responses to S* are indexed 
by closed triangles, and responses to 
S- are indexed by open triangles. On 
the ordinate is the reciprocal of latency 
(X 100) on a logarithmic scale. The 
arbitrary infinite latency (no response 
in 15 seconds) is at the bottom of the 
ordinate scale. 


It is convenient for us to describe 
our results by individual subjects. All 
subjects lead us to the same general 
conclusions but show slightly differing 
phenomena. 

Subject #21, shown in Figure 1, on 
the last 20 trials before curare-condi- 
tioning, showed normally stable avoid- 
ance responding to the original training 
stimulus, S^. When tested on S* and 
S- before the  curare-conditioning 
trials, he gave no responses. Like 
most subjects, he pricked up his ears, 
looked around, but made no incipient 
panel-presses. After curare-condition- 
ing, we can see that there still was no 
response made to the S7. On the other 
hand, a féw responses appeared to S5 
and these were of short latency. The 
latency of responses to S? were of the 
same order of magnitude as were those 
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LAST 20 TRIALS TEST TRIALS— EXTINCTION AFTER CURARE CONDITIONING 


BEFORE CURARE 
Frc. 1. Transfer effects in Dog #21. 


Latencies of panel-pressing responses to the original 


training stimulus, S°, and to the two stimuli, S* and S5, used in the curare-conditioning pro- 
cedure. (Note that the latencies are expressed in reciprocal form. Note that this dog shows 


Very little differentiation of S* and S7.) 
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Fic. 2. "Transfer effects in Dog #22. Latencies of panel-pressing responses to the original 
training stimulus, S°, and to the two stimuli, S* and S~ used in the curare-conditioning pro- 


cedure. (Note that the latencies are expressed in reciprocal form. 


Note that this dog shows 


very little differentiation of S* and S- on early test trials, but there is clear evidence of more 
rapid extinction of panel-pressing to S~ than to S*) 


prior to curare-conditioning. One re- 
sponse was made to S- in 35 tests. 
Ten responses were made to S* in 42 
tests. We felt that the presentation 
of 10 S- tests at the outset inhibited 
the responding to S* on later tests. 
We were especially struck by the fact 
that 30 subsequent test trials on S° 
led to a facilitation of panel-pressing to 
S*. Following these 30 S? trials, there 
were 7 responses in 18 S* trials and 1 
response in 15 S- trials. The results 
on this animal were promising enough 
to lead us to replicate the experiment 
on other subjects. 

Subject £22, shown in Figure 2, 
achieved stable avoidance responding 
prior to curare-conditioning. When 
tested with S- and S* prior to curare- 
conditioning, he showed no panel- 
presses, indicating no sensitization ef- 
fect at that stage of the experiment. 
Following curare-conditioning, how- 
ever, this subject pressed the panel in 
response to all three CSs. But after 


approximately 40 test trials, the la- 
tencies of avoidance responding to S- 
were clearly more variable than those 
to St. After approximately 80 test 
trials, the response latencies to S- were 
significantly longer than those to S*, 
and there were several failures to re- 
spond to S~. If we look at Test Trials 
110-160, there can be no doubt that 
Subject #22 was responding strongly 
to St but very poorly or not at all to 
S. There were nine failures to re- 
spond to S- on Test Trials 110-137. 
On Test Trials 138-163, there were 
no failures to respond to S*. Interest- 
ingly enough, when finally tested on 
S°, on which Subject #22 had tei d 
nally been trained, there were 13 fail- 
ures to respond in 17 tests. — 3 

Subject £23, shown in wed m. 
gave stable avoidance respon p o 
S? prior to curare-conditioning. When 
tested for sensitization on S" and S 
prior to curare-conditioning, he gave 


"ne -COn- 
no responses. Following curare-con 
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Fic. 3. Transfer effects in Dog #23. Latencies of panel-pressing responses to the 
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original 


training stimulus, S°, and to the two stimuli, S* and S~ used in the curare-conditioning pro- 


cedure. (Note that the latencies are expressed in reciprocal form. 


Note that this dog 


shows excellent differentiation of S* and S- at the beginning of transfer tests, and that 


responses to S* extinguish rapidly.) 
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ditioning Subject #23 was first tested 
on S- and failed to respond in 10 tests, 
Then, when S* was next presented, he 
responded on all 10 tests, but the 
latencies were longer than those to 
S? had been prior to curare-condition- 
ing. After three failures to respond 
again to S-, there were three long- 
latency responses to S* and then there 
were 10 failures to respond. Extinc- 
tion on S* was not, however, as com- 
plete as on S-. When finally tested 
on the original S°, Subject #23 showed 
stable responding, similar to that 
shown prior to curare-conditioning. 
Such tests on S? seemed to facilitate 
subsequent responding to S* but not 
to S". There were no responses to S 
during the whole course of testing. 
However, when S° was tested, subse- 
quent tests on S* always yielded a few 
responses followed by esHastión. 
Finally, the response to S° extin- 
guished to some extent after S? had 
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of panel-pressing responses to the original 
used in the curare-conditioning pro- 
rocal form. Note that there is good 


are given. Note the long latencies of 


been presented 45 times over Test 
Trials 47-99. On Test Trials 108-122 
there were 11 failures to respond to S°. 
Thus, even when the panel-pressing 
avoidance response was in the process 
of extinguishing, the differentiation be- 
tween S- and S* was striking. 

Subject #25, shown in Figure 4, like 
the other subjects, achieved stable 
avoidance responding prior to curare- 
conditioning. He showed no sensitiza- 
tion phenomenon prior to curare-con- 
ditioning when pretested on S- and S*. 
Subsequent to curare-conditioning, 
however, there was good evidence of 
differentiation between S- and S*. We 
started testing with S- and obtained im- 
mediate panel-pressing. The latencies 
were variable, however, and many of 
them were longer than those previously 
seen in response to S? prior to the 
curare-conditioning session. In con- 
trast, responding to S* was stable and 
of the same order of magnitude as that 
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obtained on S° prior to curare-condi- 
tioning. There were no failures to 
respond quickly to S*. But there were, 
between Test Trials 1-65, 23 failures 
to respond in 35 presentations of S-. 

We then embarked on a long series 
of tests, extending over three experi- 
mental days, during which we tried to 
extinguish the panel-pressing response. 
We alternated long sequences of S* and 
S°, testing infrequently on S^, hoping 
to weaken the panel-pressing response. 
In Figure 4 we show the results of 
this testing after 420 trials. The re- 
sponse to S? was clearly weaker than 
it had been. The response to S- was 
Still absent. In contrast, the response 
to S* was only slightly weaker than 
it had been about 50 trials earlier. 
Here, then, was a case where curare- 
conditioning, even though the panel- 
pressing response could not be per- 
formed during it, produced a stronger 
avoidance response to S* than that to 


S°, even though the panel-press re- 
sponses were made originally to S? 
and not to S*! 

Subject #32 is shown in Figure 5. 
He, like the other subjects, achieved 
stable avoidance response latencies prior 
to curare-conditioning. When tested for 
sensitization with S- and S*, he gave 
no panel-pressing responses. After 
curare-conditioning, Subject #32 was 
first tested with S-, and he made no 
panel-pressing responses. Then, when 
tested with S* he again made no re- 
sponses. Finally, when tested with 
the original training stimulus, S°, he 
responded with characteristic short la- 
tencies. Following three quick re- 
sponses to S°, he responded with 
longer latencies to S*. Then there 
were two responses to S^, followed by 
a failure to respond. From Test Trials 
40-76, there were no responses to 16 
presentations of S^; there were 21 re- 
sponses out of 22 tests on S'; and 
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Fic. 6. Transfer effects in Dog #33. Latencies of panel-pressing responses to the original 
training stimulus, S°, and to the two stimuli, St and S^, used in the curare-conditioning pro- 
cedure. (Note that latencies are expressed in reciprocal form. Note the excellent differenti- 
ation of S* and S- at the outset of testing. Note the maintenance of this differentiation over 


150 test trials.) 
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there were six responses out of nine 
tests on S°. With repeated testing on 
all three CSs, there was no apparent 
extinction trend for S? or S+, while the 
latencies of response to S- were either 
long or infinite. Over Test Trials 158— 
246, the subject gave 26 short-latency 
responses to 26 S? presentations; he 
gave 30 short-latency responses to 30 
S* presentations; and in contrast, he 
gave 19 responses in 33 presentations 
of S~, and these were of long latency. 
While there was no significant differ- 
ence between response latencies to S? 
and S+, the latencies to S- differed sig- 
nificantly from those of S* and $°. 
Here, again, there was clear evidence 
of differentiation in avoidance respond- 
ing to the two discriminative stimuli 
used in curare-conditioning. 

In Figure 6 are shown data for Sub- 


ject #33. His avoidance responses 
prior to curare-conditioning were 
stable and of short latency. When 


tested for sensitization by presenta- 
tions of S- and St, he gave no panel- 
presses. After the curare-condition- 
ing session, he was first tested on S-. 
He failed to respond to three such 
presentations. Then, when tested with 
10 successive presentations of S*, he 
responded 10 times; the first responses 
were sluggish, but they increased in 
vigor, and the final tests of this se- 
quence yielded latencies of the same 
magnitude as those characteristic of re- 
sponding to S? prior to curare-condi- 
tioning. Then three more S- presenta- 
tions elicited no responses. Three S? 
presentations then elicited short-la- 
tency panel-presses. Over the course 
of 173 test stimulus presentations, the 
responses to S* showed little change ; 
the responses to S? also remained 
stable. In contrast, the responses to 
S- were practically absent; there were 
only 2 panel-presses in a total of 49 
S- presentations, and these were slug 
gish. Here again, there was clear evi- 


dence of differentiation between S* 
and S-. 
DISCUSSION 


The results show, in all six subjects, 
that the panel-pressing responses to S* 
were more vigorous, more reliable, less 
variable, and of shorter latency than 
those to S-. Responses to S* were 
also more highly resistent to extinc- 
tion effects than were those to S-. On 
the other hand, the comparison of re- 
sponses to S° and S* does not yield a 
clear conclusion. In some subjects, S* 
was more resistant to extinction effects 
than was S°, and in other subjects, the 
reverse was true. We feel that the 
number of shocks received in original 
avoidance training with S° may be 
a factor in producing this variability of 
outcome. Dogs #32 and #33 re- 
ceived the most shocks in training, and 
were carried out to a strict avoidance 
learning criterion, 77 and 88 trials, re- 
spectively. These two subjects had 
the best differentiation between S- and 
S* in the tests. In addition, the test- 
ing procedure following curare-condi- 
tioning may be important. Presenting 
S° or S*, followed by other CSs, seems 
to elevate the general responding level 
to all CSs. Presenting S- followed by 
the other CSs, seems to depress the 
general responding level to all CSs. 
And finally, the 1200-cycle tone seemed 
slightly better as an S* than was the 
160-cycle tone. The number of sub- 
jects in our experiment is not great 
enough to prove these points, nor is 
the design of our experiment adequate 
for proof. However, the indications 
are promising enough to pursue fur- 
ther with appropriate experiments. 

The theoretical implications of this 
experiment are clear. Animal subjects 
can acquire discriminative instrumen- 
tal response tendencies without the 
overt responses themselves being ex- 
ercised in the presence of the discrim- 
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inative stimuli. The differential 
strengths of panel-pressing tendencies 
in the presence of S~ and S* could only 
have been acquired during the dis- 
criminative  curare-conditioning ses- 
sion. The pretests for sensitization 
showed no generalization from S° 
either to S- or to S* prior to curare- 
conditioning. Following curare-condi- 
tioning (during which S* was paired 
with shock and S- was presented with- 
out shock pairings), tests on all six 
subjects showed stronger panel-press- 
ing tendencies to St than to S~. In 
Some cases responses to S^ were prac- 
tically absent. Since no panel-pressing 
could occur during curare-condition- 
ing, there could be no temporal pair- 
ing either of St and Ry, or S- and 
Ross. The differential, discriminative 
panel-pressing which revealed itself 
later must have been due to the Pav- 
lovian procedure of classical, discrimi- 
native conditioning. Thus, one might 
ask, what were the mediators of this 
Special type of transfer of training or 
thinking? What conditions could have 
led our dogs to reveal, through their 
panel-presses, what they had acquired 
during the conditioning procedure? 
We feel that a systematic exploration 
of these questions could lead to a 
better knowledge of transfer and 
thought. By exclusion, we can defi- 
nitely maintain that differential periph- 
eral skeletal responses (and their asso- 
ciated feedback) could not have been 
a factor in mediating the discrimina- 
tion learning which we have clearly 
demonstrated. Thus, those theories 
of transfer of training and of thinking 
which restrict themselves to peripheral 
skeletal response mediators, and their 
correlated feedback from the periph- 
ery, cannot explain the outcome of our 
experiment. 

One possible mediation process 
Could reside in ANS conditioning and 
its correlated feedback from the ANS 
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periphery. There was some evidence, 
in a few of our subjects, that there was 
a greater cardiac response to S* than 
to S- at the end of the curare-condi- 
tioning session. (A detailed account 
of discriminative cardiac conditioning 
in curarized dogs will appear later in 
another paper.) Possibly the differ- 
ential strengths of ANS CRs elicited 
by S* and S~ could serve to mediate 
the discriminative panel-pressing be- 
havior through correlated visceral sen- 
sory feedback processes. lf so, we do 
not know the essential conditions for 
such mediation. For example, what 
outcome would be obtained if the sub- 
jects were given discriminative curare- 
conditioning first, then were trained to 
panel-press to S?, and then were tested 
on S- and S* in the normal state? 
What would have been the outcome 
if the CS-US interval used in avoid- 
ance training to S? were shorter than, 
or longer than, that used in the curare- 
conditioning session when S* was 
paired with shock? These questions, 
which definitely suggest a long series 
of parametric studies, could, if an- 
swered, tell us a great deal about con- 
ditions for transfer of training and for 
thinking. For, despite the clear results 
of this experiment, we still don't have 
a good explanation for the very “in- 
telligent" behavior of our dogs. What 
led them to panel-press to S*? They 
could have merely howled, struggled in 
the harness, or have become “frozen” 
if S* had frightened them, and they 
could have done less of this when S- 
was presented. Why did they respond 
appropriately ? 

Our subjects clearly "put together" 
two experiences in such a way that 
two stimuli, never before associated 
directly with lever-pressing, elicited 
differential lever-pressing during the 
transfer tests. The accomplishment of 
this transfer phenomenon did not re- 
quire differential skeletal responding 
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during the differentiation phase. Of 
course there may be kinds of transfer 
effects, or types of thinking, for which 
peripheral skeletal responding during 
differentiation is necessary or impor- 
tant. The type of phenomenon we 
have studied is clearly not one of these, 


SUMMARY 


Dogs were trained to avoid shock 
in response to a signal. The avoidance 
response was the pressing of a panel, 
the signal (S°) was a light going out, 
the shock was of 4-milliamperes in- 
tensity (applied to the hind toe pads), 
and the time interval between signal 
onset and shock onset was 10 seconds. 
If the correct response occurred dur- 
ing the time interval between signal 
onset and the usual shock presentation 
time, no shock was given and the sig- 
nal was terminated. Intertrial inter- 
vals were varied systematically, with a 
mean of 1.5 minutes. 

After the dogs were reliably pressing 
the panel in Tesponse to the signal, 
with response latencies of 3 seconds or 
shorter, they were totally paralyzed by 
curarization. While the dogs were 
thus completely immobilized under 
curare, a Pavlovian discriminative con- 
ditioning session was carried out. A 
new signal (S*) was consistently 
paired with shock, using a delayed 
conditioning procedure and a time in- 
terval of 10 seconds between S* onset 
and shock onset. The shock duration 
was 5 seconds. On some trials a con- 
trasting signal (S-) was presented for 
15 seconds, but it was not paired with 
shock. A sequence of 99 discrimina- 
tive conditioning trials was presented, 

i i * trial. The S* and 
ending with an S* trial. e St 
S- trials were partly randomized in a 

special sequence. After this condi- 
tioning session, the dogs were given 
48 hours in which to recover from the 
various physiological side effects of 
curarization. Next they were returned 
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to the training situation in the normal, 
undrugged state, and the three previ- 
ously used stimuli (S°, S*, and SP) 
were presented. The latency of panel- 
pressing responses was recorded dur- 
ing these tests of the efficacy of the 
three stimuli. 

The dogs responded in a way con- 
sistent with their discriminative Pav- 
lovian conditioning experience under 
curare. There were frequent panel- 
presses in response to S° and S*, very 
few in response to S-. When a dog 
pressed a panel in the presence of S7, 
the latency of the response was often 
long. 

The technique used in this experi- 
ment is a special case of training 
wherein the trainee can do nothing 
overtly about the information or stimu- 
lus sequences as they are presented. 
It therefore gives the experimenter 
very precise stimulus control, because 
the usual overt responses of the sub- 
ject are not present to modify the stim- 
ulus situation in unpredictable ways. 

This experiment demonstrates. that 
certain types of transfer of training or 
problem solving can occur without the 
benefit of mediation by peripheral skel- 
etal responses or their associated feed- 
back mechanisms. Whether or not 
peripheral ANS reaction mechanisms 
play a role in such transfer phenomena 
remains to be seen, and such a possibil- 
ity should be seriously explored in 
future experiments, 
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THE CREATIVE PROCESS? 
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University of Michigan 


The intent of this paper is the pres- 
entation of an associative interpreta- 
tion of the process of creative thinking. 
The explanation is not directed to any 
specific field of application such as art 
or science but attempts to delineate 
processes that underlie all creative 
thought. 

The discussion will take the follow- 
ing form. (a) First, we will define 
creative thinking in associative terms 
and indicate three ways in which crea- 
tive solutions may be achieved—seren- 
dipity, similarity, and mediation. (b) 
This definition will allow us to deduce 
those individual difference variables 
which will facilitate creative perform- 
ance. (c) Consideration of the defi- 
nition of the creative process has sug- 
gested an operational statement of the 
definition in the form of a test. The 
test will be briefly described along with 
some preliminary research results. 
(d) The paper will conclude with a 
discussion of predictions regarding the 
influence of certain experimentally 
manipulable variables upon the crea- 
tive process. 

Creative individuals and the proc- 
esses by which they manifest their 
creativity have excited a good deal of 


1'The essence of this paper was written 
while the writer was a Visiting Research 
Psychologist at the Institute of Personality 
Assessment and Research, University of 
California, Berkeley. The author wishes to 
acknowledge his indebtedness to Benton Ms 
Underwood for his encouragement and - 
stimulation of his paper, nm, ^ 
research on thinking" (1952). W " on = 
material has been supported by the cima s. 
tive Research nouo of Be xs 
education ( Projec o. 
o Sience Foundation (Grant G-3855). 
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interest and curiosity. There are ex- 
tended analyses of novels and novel- 
ists, poems and poets, mathematics and 
mathematicians, both biographical and 
autobiographical. Perusal of the intro- 
spections of manifestly creative in- 
dividuals uncovers a surprising vein of 
similarity in the processes they de- 
scribe (Ghiselin, 1952). Thus we 
find Albert Einstein’s self-searching to 
suggest that “The psychical entities 
which seem to serve as elements in 
thought are certain signs and more or 
less clear images which can be com- 
bined . . . This combinatory play 
seems to be the essential feature in 
productive thought.” Samuel Taylor 
Coleridge is described as having de- 
veloped his ideas in the following man- 
ner : “Facts which sank at intervals out 
of conscious recollection drew together 
beneath the surface through the almost 
chemical affinities of common ele- 
ments.” In the field of art, we find 
André Bréton referring to a collage 
by Ernst as being distinguished by a 
“marvelous capacity to grasp two mu- 
tually distant realities without going 
beyond the field of our experience and 
to draw a spark from the juxtaposi- 
tion.” Most explicit, however, is the 
oft-quoted statement by the mathema- 
tician, Poincaré, who talks about an 
evening when “ideas rose in crowds; I 
felt them collide until pairs interlocked 
so to speak, making a stable combina- 
tion. By next morning I had estab- 
lished the existence of a class of Fuch- 
sian functions.” From these experi- 
ences, Poincaré felt that he could state 
that “to create consists of making new 
combinations of associative elements 
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© which are useful. The mathematical 
facts worthy of being studied . . . are 
those which reveal to us unsuspected 
kinships between other facts well 
known but wrongly believed to be 
strangers to one another. Among 
chosen combinations the most fertile 
will often be those formed of elements 
drawn from domains which are far 
apart.” An exceptionally compelling 
illustration of a useful combination of 
elements “drawn from domains which 
are far apart" occurs in a line from the 
poem, “The Monkey Puzzle" by Mari- 
anne Moore (1951), “The lion's fero- 
cious chrysanthemum head.” 

We will state our basic hypothesis 
regarding the nature of creative think- 
ing in the form of a definition. With 
these introspective statements serving 
as background, we may proceed to de- 
fine the creative thinking process as the 
forming of associative elements into 
new combinations which either meet 
specified requirements or are in some 
way useful. The more mutually re- 
mote the elements of the new combina- 
tion, the more creative the process or 
solution. An additional criterion of 
the level of creativeness of a product 
is described below. 

Creative thinking as defined here is 
distinguished from original thinking 
by the imposition of requirements on 
originality. Thus, 7,363,474 is quite 
an original answer to the problem 
“How much is 12 +12?” However, 
it is only when conditions are such that 
this answer is useful that we can also 
call it creative. There are many orig- 
inal ideas expressed in institutions for 
the mentally ill and mentally retarded ; 
few of these are likely to be creative. 
There are many fields of creative en- 
deavor in which the usefulness of prod- 
ucts would be difficult to measure re- 
liably, While these difficulties must 
sventually be faced, for the present our 
*Search efforts have been concentrated 
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on laboratory situations in which cri- 
teria for usefulness can be arbitrarily 
experimenter-defined and unequivo- 
cally explained to the subject. The 
originality of a response is simply in- 
versely related to its probability in a 
given population. 

It should be pointed out that this 
definition of creativity is quite similar 
to basic notions advanced by British 
associationists from Locke (1690) to 
Bain (1855), and by those psycholo- 
gists whose work is based in large 
measure on their speculations. Freud 
(1938), Hollingsworth (1928). and 
Binet (1899) may serve as examples. 


Ways OF ACHIEVING A CREATIVE 
SoLuTION 


In terms of associative theory, we 
may point to three ways of achieving 
a creative solution. Generally, any 
condition or state of the organism 
which will tend to bring the requisite 
associative elements into ideational 
contiguity will increase the probability 
and speed of a creative solution. 
Therefore, the following three ways of 
attaining creative solutions are all 
methods of bringing the requisite asso- 
ciative elements together. 

Serendipity. The requisite associa- 
tive elements may be evoked contigu- 
ously by the contiguous environmental 
appearance (usually an accidental con- 
tiguity) of stimuli which elicit these 
associative elements. This sort of cre- 
ative solution is often dubbed seren- 
dipitous. This is the manner of dis- 
covery to which is popularly attributed 
such inventions as the X ray and such 
discoveries as penicillin. One physi- 
cist has described how he has reduced 
serendipity to a method by placing in 
a fishbowl large numbers of slips of 
paper, each inscribed with a physical 
fact. He regularly devotes some time 
to randomly drawing pairs of these 
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facts from the fishbowl, looking for 
new and useful combinations. His 
procedure represents the operational 
embodiment of this method of achiev- 
ing creative solutions. 

Similarity. The requisite associative 
elements may be evoked in contiguity 
as a result of the similarity of the 
associative elements or the similarity 
of the stimuli eliciting these associative 
elements. This mode of creative solu- 
tion may be encountered in creative 
writing which exploits homonymity, 
rhyme, and similarities in the structure 
and rhythm of words or similarities in 
the objects which they designate. The 
contiguous ideational occurrence of 
such items as alliterative and rhyming 
associates may be dependent on a fac- 
tor such as primary stimulus general- 
ization. It seems possible that this 
means of bringing about contiguity of 
associational elements may be of con- 
siderable importance in those domains 
of creative effort which are less di- 
rectly dependent on the manipulation 
of symbols. Here we might include 
certain approaches to painting, sculp- 
ture, musical composition, and poetry. 

Mediation. The requisite associa- 
tive elements may be evoked in con- 
tiguity through the mediation of com- 
mon elements. This means of bringing 
the associative elements into contigu- 
ity with each other is of great impor- 
tance in those areas of endeavor where 
the use of symbols (verbal, mathemati- 
cal, chemical, etc. . . .) is mandatory. 
For example, in psychology, the idea 
of relating reactive inhibition and cor- 
tical satiation may have been mediated 
by the common associates "tiredness" 
or "fatigue" (Kohler & Fishback, 


1950). 
INDIVIDUAL DIFFERENCES 


From the definition given above, the 
factors that will make for individual 
differences in the probability of achiev- 
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ing creative solutions may be deduced. 
Any ability or tendency which serves 
to bring otherwise mutually remote 
ideas into contiguity will facilitate a 
creative solution; any ability or ten- 
dency which serves to keep remote 
ideas from contiguous evocation will 
inhibit the creative solution. 

Listed below are several illustrative 
predictions concerning individual dif- 
ferences that one may make from 
this theoretical orientation. 


Need for Associative Elements 


It should be clear that an individual 
without the requisite elements in his 
response repertoire will not be able 
to combine them so as to arrive at a 
creative solution. An architect who 
does not know of the existence of a 
new material can hardly be expected 
to use it creatively. 


Associative Hierarchy 


The organization of an individual’s 
associations will influence the proba- 
bility and speed of attainment of a 
creative solution. There is a whole 
family of predictions that one may 
draw from this concept of the associa- 
tive hierarchy. As an initial example, 
let us take the question of the manner 
in which the associative strength 
around ideas is distributed. If we 
present an individual with the word 
"table," what sort of associative re- 
sponses does he make? The individ- 
ual who tends to be restricted to the 
stereotyped responses, such as “chair,” 
may be characterized as having an as- 
sociative hierarchy with a steep slope 
(see Figure 1). That is, when you get 
past the first one or two conventional 
responses to the stimulus, the individ- 
uals associative strengths to other 
words or ideas (lower in the hier- 
archy) drops rapidly. We can also 
conceive of a second sort of individual 
whose associative hierarchy is charac- 
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terized by a rather flat slope. This is 
an individual who perhaps also has as 
his strongest response the conventional 
chair. But for him this response is not 
overly dominant and so it is more 
likely that he will be able to get to the 
less probable, more remote kinds of 
associations to table. It is among these 
more remote responses that the requi- 
site elements and mediating terms for 
a creative solution will be lurking. 
This slope factor may be related to 
the mathematical analysis of associa- 
tive production developed by Bous- 
field, Sedgewick, and Cohen (1954). 
It probably is closely approximated 
by their constant, m, measuring rate of 
depletion of the associative reservoir. 
They found a high negative correlation 
between rate of association and total 
number of associations. It would be 
Predicted from Figure 1 that the high 
Creative subject (flat hierarchy) would 
Tespond relatively slowly and steadily 
and emit many responses while the 
low creative subject (steep hierarchy) 
Would respond at a higher rate but 
emit fewer responses. 

It would be predicted that the 
greater the concentration of associative 
Strength in a small number of stereo- 
typed associative responses (steep 
hierarchy) the less probable it is that 
the individual will attain the creative 
solution. Thus, the word association 
behavior of the high creative individual 
should be characterized by less stereo- 
typy and commonality. This last pre- 
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diction is supported by a study by 
Mednick, Gough, and Woodworth 
(Mednick, 1958). Research scientists 
rated for creativity were divided into 
relatively high (N — 15) and rela- 
tively low (NV = 15) groups. The low 
creatives gave more stereotyped re- 
sponses on 80% of a group of 36 test 
words from the Kent-Rosanoff list. 
(These test words were chosen for 
their tendency to elicit stereotyped re- 
sponses. Stereotypy was defined by 
the Minnesota Kent-Rosanoff Word 
Association Norms, Russel & Jen- 
kins, 1954). It should be pointed out 
that these results lend themselves to 
another possible interpretation. The 
highly creative individual may also 
have a steep hierarchy but a deviant 
one. That is, his most dominant asso- 
ciative response may be quite strong 
but quite different from the popular, 
dominant associative response. There 
are different predictions that can be 
made for the flat-associative-hierarchy 
creative and the steep-deviant associa- 
tive-hierarchy creative. The latter is 
more likely to be the one-shot pro- 
ducer (a not uncommon phenomenon 
among novelists). If he does create 
further products, they will tend to re- 
semble closely the first product. Thee 
former is more likely to be a multi- 
producer; he is more likely to produce 
in a variety of avenues of creative ex- 
pression. 

The prediction suggesting an expec- 
tation of less creativity from an in- 
dividual with a high concentration of 
associative strength in a few responses 
leads to another prediction. The 
greater the number of instances in 
which an individual has solved prob- 
lems with given materials in a certain 
manner, the less is the likelihood of 
his attaining a creative solution using 
these materials. Such an individual 
will *know the meaning" of the ele- 
ments of the subject matter. That is, he 
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will have a steep associative hierarchy 
around these elements. An example 
of the operation of this principle re- 
cently occurred to the writer while 
teaching an honors freshman introduc- 
tory course in psychology at the Uni- 
versity of Michigan. I was giving a 
well known interpretation of a well 
known experiment in stimulus general- 
ization when interrupted by a student 
who calmly stated that the interpreta- 
tion was in error. After a few minutes 
of blustering I asked him to explain. 
His explanation proved him to be cor- 
rect. I had been dealing with this 
material for years and "knew" the 
"correct" interpretation; for him this 
material was new, he had a low, flat 
associative hierarchy. Thus, if a new- 
comer to a field has the requisite in- 
formation, he is more likely to achieve 
a creative solution than a long-time 
worker in the field. This may be the 
reason that theoretical physicists and 
master chess players are often said to 
have passed their prime by the age of 
25. 


Number of Associations 


The greater the number of associa- 
tions that an individual has to the re- 
quisite elements of a problem, the 
greater the probability of his reaching 
a creative solution. This variable is 
not independent of the preceding one 
since an individual with a high con- 
centration of associative strength in 
few associative responses is not likely 
to have a proliferation of associations. 
The more associates which are evoked 
by a requisite element of a problem, 
the more likely it is that an associate 
will exist which will serve as a medi- 
ating bridge to another requisite ele- 
ment, facilitating combination. It 
seems likely that this variable will not 
be related to speed of creative solution 
since it may take a good deal of time 

to get to the mediating links. 
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Cognitive or Personality Styles 


Previously learned or innately pre- 
disposed methods of approaching prob- 
lems will influence the probability of 
a creative solution. If the requisite 
associational elements of a new and 
useful combination are probable asso- 
ciates of the concrete representations 
of relevant aspects of the problem, an 
individual with a predominately “per- 
ceptual” approach will be more likely 
to reach a creative solution. If, how- 
ever, the requisite associational ele- 
ments are not elicited as responses to 
these concrete representations or if 
there is no concrete representation then 
an individual with a “conceptual” ap- 
proach will be more likely to reach a 
creative solution. 

Another cognitive style of impor- 
tance may lie along the "visualizer- 
verbalizer" dimension. The visualizer 
is one who tends to call up relatively 
complete memorial sensory representa- 
tions of the relevant concrete aspects 
of problems. If the problem deals with 
horses, he tends to picture a horse in 
terms of its sensory qualities. On the 
other hand, the verbalizer explores the 
problem by associating with words 
around the word "horse." If the req- 
uisite elements are high in his verbal 
associative hierarchy to the word 
horse, the verbalizer will be more likely 
to attain a creative solution; the visual- 
izer may be thrown off or at least de- 
layed by many false leads. On the 
other hand, if a requisite verbal asso- 
Ciative response to the word horse is 
very low, or not present in the ver- 
balizer's hierarchy, then the visualizer 
will be more likely to attain the crea- 
tive solution. It is therefore clear that 
some types of problems will be solved 
more easily by the visualizer and some 
by the verbalizer. 

Factors such as these (admittedly 
very poorly defined) may be partly re- 
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sponsible for differential aptitudes for 
creative work in differing fields. 


Selection of the Creative Combination 


. The creative combination of elements 
is only one among the many which may 
present themselves to the subject. How 
or why is the creative combination se- 
lected? Some speculations regarding 
this problem follow. The explanation 
of the process of selection may be re- 
lated to the nature of the problem. 
Problems either entail a specific and 
relatively objective set of testable cri- 
teria (Paint a realistic portrait of this 
individual. Design a refrigerator so 
that it will be automatically free of 
frost.) or they do not (The chemist 
mixes two liquids out of curiosity. The 
painter dabs hopefully at a fresh canvas 
waiting for an idea. The psychologist 
tosses a new test into a correlation 
matrix). When specific criteria are 
provided, they form an important part 
of the stimulus set which is determining 
which associative elements are being 
elicited and thus becoming eligible for 
entering into combination with other 
elements, Important sets of associa- 
tions to each of these combinations are 
the consequents of the combinations. 
The Set of consequents for each com- 
bination (1f I put x, y, and z together, 
cere sé wil happen) is continually 
d pin ed the set of requirements 
auem or em. When the set of conse- 
& new combination achieves 

a close fit with the set of problem re- 
quirements, this combination is selected. 
When there is complete overlap of sets, 
search behavior" is terminated. As 
with the other requisite elements of the 
problem, individual differences in this 
g will vary with (among other 
- ings) the structure of the associa- 
Onal hierarchies around the require- 
rig of the problem. When the 
Tigerator-defroster problem was pre- 
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sented to an undergraduate class almost 
all of the proferred solutions were based 
on the principle of ridding the refriger- 
ator of already heavily accumulated 
frost. A couple of individuals (possi- 
bly familiar with the defroster principles 
presently in use) suggested methods 
which disposed of the frost before it 
built up to an overly annoying level. 
In addition to these there were two 
unique responses, i.e., a “new” method 
of preventing moisture from condens- 
ing in the freezer compartment, and a 
method of allowing frost to accumulate 
but limiting the location of accumula- 
tion to a small box which could be regu- 
larly and conveniently removed and 
emptied. Thus it may be seen that an 
individual's associations to the require- 
ments may be characterized as to their 
stereotypy; the imposed requirements 
of the problem may be viewed as part 
of the requisite elements in the situa- 
tion. The earlier theoretical statements 
concerning these elements may be seen 
as being relevant here. The foregoing 
suggests an explanation of the selection 
process for the case where the subject 
must hunt for a combination of ele- 
ments which will satisfy given criteria. 
In the case where no criteria are speci- 
fied, the subject is typically producing | 
random combinations of elements; the 
task of selection in this case consists in 
finding relevant criteria for the given 
partial products. 

If we may continue along a bit fur- 
ther with this example of the defroster, 
we may begin to see some glimmerings 
of a solution to the most serious prob- 
lem in research on creative thinking— 
how may we determine to what degree 
behavior is creative? We have sug- 
gested one criterion in our hypothesis. 
In the following an additional criterion 
is developed. To begin with let us ex- 
amine the requirements as originally 
stated— "Design a refrigerator so that 
it is automatically free of frost." The 
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first thing that strikes us is that while 
Some requirements have been stated, 
there are even more that are strongly 
implied and essential, many that are 
desirable, and a number that we would 
only become aware of after some 
method of satisfying them had been 
suggested. 

Let us examine some possible so- 
lutions : 

l. Simply refraining from opening 
the refrigerator door would solve the 
problem as stated since this would pre- 
vent moisture from entering and con- 
densing as frost. This solution meets 
many of the implied requirements. It 
is cheap, convenient, effective, does 
not require special training, etc... . 
However, it is not an optimal solution 
since it violates one essential, implied 
requirement—the usefulness of the 
refrigerator must not be impaired. 

(This is the cutting-off-your-nose-to- 
spite-your-face solution.) 

2. A primitive solution is the 
hammer-and-screwdriver method. This 
is tried and true and meets many of the 
essential requirements, Tt falls down 
in that it is inconvenient, messy, un- 
economical (when caked with frost, the 
refrigerator unit is very inefficient), en- 

ngers the mechanism, and is hardly 

1tomatic, 

3. In a refrigerator we once owned 
another solution was used. The open- 
ing and closing of the refrigerator door 
operated a counter. Ata certain count 
the refrigerator unit was automatically 
heated and the melted water evaporated 
outside the refrigerator. The superi- 
ority of this solution is immediately ap- 

parent. The source of this superior- 
ity lies in the number of requirements 
which it meets. It is economical, auto- 
matic, convenient, peculiarly appropri- 
ate (the operation of the heating 
element is contingent upon the number 
of door openings. The amount of frost 
accumulated is also in part dependent 
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on the number of door openings.), does 
not interfere with the normal use of the 
refrigerator, and does not require spe- 
cial training. Note that the principle 
behind this highly creative solution 
(not allowing massive build-ups of 
frost) was infrequently suggested in 
the classroom group. However, this 
solution is not wholly successful at 
meeting some criteria. The frequent 
heating and cooling may injure frozen 
food stored near the heating element, 
Secondly, since the heating process 
must be brief and mild, it is inevitable 
that not all frost is removed. While 
this solution does effectively curtail the 
number of defrostings, it does not elim- 
inate them completely, It is clear that 
a method which would encompass all of 
the advantages of the “counter” method, 
but which would, in addition, eliminate 
defrosting altogether would be even 
more creative. What js Suggested by 
this discussion is that the creativeness 
of a product is Some function of the 


number of requirements that the prod- 
uct meets, 


research in which tasks, solutions, and 


A TEST or Creativity 


The definition of the creative process 
has Suggested a way of testing for indi- 
vidual differences in creativity. The 
test items are intended to require the 
testee to perform creatively. That is, 
he 1$ asked to form associative elements 
into new combinations by providing 
mediating connective links. Since the^ 
lest situation is contrived, the combina- 
tion must meet specified criteria that 
are experimenter imposed. 

The definition dictates the structure 
of the test. We must provide stimulus 
items from two mutually distant reali- 
ties and ask the subject to “draw a 
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spark from their juxtaposition.” To 
state it more usefully, we must provide 
stimulus elements from mutually re- 
mote associative clusters and have the 
subject find a criteria-meeting medi- 
ating link which combines them. A 
first problem concerns the type of ma- 
terial of which the stimulus item should 
be composed. If the test is to be ap- 
propriate for all fields of creative en- 
deavor, the material must either be 
nonsensical so as to avoid bias favoring 
any specific means of creative expres- 
sion, or it must be so common in so- 
ciety that familiarity could be assumed 
to be high across fields of interest. The 
problems involved in constructing the 
nonsense materials so as to avoid favor- 
ing any interest groups soon proved to 
be apparently insurmountable. This 
left us searching for materials with 
which most individuals in the culture 
could claim acquaintance; this, in turn, 
brought us to verbal materials. 

While it may be true that certain 
occupational groups have extensive ex- 
perience in dealing with words, there 
are some verbal associative habits that 
could reasonably be assumed to be fa- 
miliar to almost all individuals that 
have been brought up in this (USA) 
culture. Among such habits are the 
associative bonds between words like 
“ham and eggs,” “bed-bug,” “pool-hall,” 
“hound-dog,” “whole-wheat,” “chorus- 
girl,” “kill-joy,” and “red-hot.” These 
became the materials for the test. 

Having decided on the materials, the 
test almost constructed itself in accord- 
ance with the definition. Several words 
from mutually distant associative clus- 
ters must be presented to the subject; 
his task must be to provide mediating 
links between them. Further, (a factor 
of extreme importance), the mediating 
link must be strictly associative rather 
than being of a sort that follows elabo- 
rate rules of logic, concept formation, 
or problem solving. In their final (or 


227 


at least present) form, the test items 
consist of sets of three words drawn 
from mutally remote associative clus- 
ter. One example might be: 


blue 


The subject is required to find a fourth 
word which could serve as a specific 
kind of associative connective link be- 
tween these disparate words. The an- 
swer to Example 1 is “cheese.” 
“Cheese” is a word which is present in 
the word pairs “rat-cheese,” "blue- 
cheese,” and “cottage-cheese.” The 
subject is presented with several ex- 
amples so that he has an adequate op- 
portunity to achieve the specific set 
necessary for the task. 


Example 1: rat cottage 


Example 2: railroad girl class 
Example 3: surprise line birthday 
Example 4: wheel electric high 
Example 5: out dog cat? 


(None of these examples is a test item 
from any form of the actual test.) The 
two college level forms of the test (one 
coauthored by Sharon Halpern and the 
other by Martha T. Mednick) have 30 
items each; the subject is allowed 40 
minutes; his score is the number right. 

The test, called the Remote Associ- 
ates Test (RAT), has some interesting 
correlations with other measures. 

Comparisons with Criteria. A stud: j 
was conducted at the College of Archi- 
tecture, University of California, Berke- 
ley, by the writer and Sharon Halpern. 
Ratings of creativity by faculty mem- 
bers of the College who taught the De- 
sign courses were correlated with RAT 
scores. These ratings form an unusu- 
ally excellent criterion of creative per- 
formance since the raters had been 
advising and evaluating the students in 
the creation of new designs and models 
of structures. They had been working 
with these students for at least a year 

2Answers to sample RAT items: 2. 
working; 3. party; 4. chair or wire; 
house. 
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and in many cases two or more. The 
ratings and RAT scores correlated sig- 
nificantly (r = .70, df = 19, p < .01) 
In this study an early form of the RAT 
was used. 

The RAT was administered to a 
group of first year psychology graduate 
students at the University of Michigan 
whose native language was American 
English (N = 35). Faculty research 
supervisors (who had been directing 
the independent research efforts of the 
students), rated the eight highest and 
eight lowest RAT scorers either “high 
or “low” in research creativity (no 
middle category allowed). Research 
creativity was defined as being demon- 
strated if the student developed new 
research methods and/or pulled to- 
gether disparate theory or research 
areas in useful and original ways. Of 
the 16 research supervisors, one felt 
that he had not had enough contact 
with his student to make the judgment. 
His student was a low RAT scorer. 
Of the eight high RAT scorers, six 
were rated high on research creativity 
and two were rated low; of the seven 
low RAT scorers, only one was rated 
high, the other six being rated low. 
By Fisher's exact test the probability 
_ of these events occurring by chance is 
less than .05. Miller Analogies Test 
(MAT) scores were available for these 
students. Of the seven high MAT 
scorers, three were rated high on re- 
search creativity; of the eight low 
MAT scorers, four were rated high in 
research creativity. 

Reliability. The Spearman-Brown 
reliability of the RAT was .92 in one 
sample (289 women, almost all the 
students at an Eastern women s college, 
tested as part of a project under the di- 
rection of Theodore Newcomb) and 
91 in another (215 men tested at the 
University of Michigan as part of a 


project under the direction of Warren 


T. Norman). 
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Correlation with Grades. One of the 
present forms of the RAT correlated 
negatively with the first-two-year grade 
point averages of a group of under- 
graduates at a large Eastern technology 
college. (r= —.27, N = 74, p < 05). 
This same correlation was obtained 
with the summer grades of a smaller 
group, N = 34, of summer students at 
a large Eastern liberal arts college (not 
statistically significant in this case). 
In a study by Miller (1960) it was 
found that high RAT scorers tended to 
get higher grades from teachers rated 
as flexible than from teachers rated as 
dogmatic. Low RAT scorers received 
higher grades from teachers rated dog- 
matic than from teachers rated as 
flexible. 

Correlation with Social Attitudes 
and Occupational Interests. Tt is clear 
that creative individuals must have 
access to improbable associative re- 
sponses. Kowalski (1960) hypothe- 
sized that this is a general tendency 
which also manifests itself in their atti- 
tudes and interests. She tested and 
interviewed 15 high RAT scoring and 
15 low RAT scoring undergraduate 
women. The two groups had radically 
differing views on sexual morality and 
women’s rights. The views of the high 
creatives were more atypical and 
“liberal” (U —37, p«.001). On 
the Strong Vocational Interest Blank, 
Mens’ Form (SVIB), the high creative 
group showed "significantly higher in- 
terest on the artist (p < .05), psycholo- 
gist (P < .005), physician (p < .025), 
mathematician (p < .025), and author- 
journalist (p < .05) keys. The low 
creative group showed higher interest 
on the farmer (p < .05), math-physical 
science high school teacher (p < .05), 
office man (p < .05), and pharmacist 
(p < 01) keys" (p. 19). (These are 
the probability values of obtained chi 
squares.) The only one of these keys 
related to ACE scores was that of 


— 


Tue ASSOCIATIVE Basis OF THE CREATIVE PROCESS 229 


physician. “The commonality of these 
interest patterns was evaluated by 
noting the per cent overlap of the 
specific key with the general popula- 
tion of interest expression. For ex- 
ample, the Farmer key overlaps 45% 
with the general population, while the 
Artist key overlaps 20%” (p. 20). 
The significant keys of the higher 
creatives had significantly less com- 
monality than the significant keys 
of the low creatives. These differ- 
ences were independent of the influence 
of intelligence as measured by the ACE. 

Associative Behavior. In the discus- 
sion of illustrative predictions it was 
suggested that highly creative individu- 
als would be characterized by a flat as- 
sociative hierarchy rather than a steep 
associative hierarchy. Further, it was 
proposed that the greater the number 
of associations that an individual has to 
the requisite elements of a problem, the 
greater the probability of his reaching 
a creative solution. From these two 
independent statements it may be de- 
duced that when required to display his 
reservoir of associations to single stim- 
ulus words, the highly creative indi- 
vidual will have greater access to less 
probable associates and therefore pro- 
duce a greater number of associates. A 
study by Craig and Manis (1960 un- 
published *) supports this deduction. 
Thirty-eight college students had the 
RAT and an associative task admin- 
istered to them. In the associative task 
they were given 1 minute to write as 
Many associates as they could to each 
of 20 words. The correlation of the 
number of such associates with RAT 
Scores was .38 (p < .01). 

In two related studies, Karp (1960) 
and Kowalski (1960) found RAT 
Scores to be directly related to the or- 


" Craig, M., & Manis, M. Prediction of 

Hose on the Remote Associates Test by 

Pos of response repertoire. Unpublished 
Nuscript, 1960. 


iginality and quantity of anagrams con- 
structed using the test word "Genera- 
tion." In the Karp study 40 under- 
graduates were given 5 minutes to pro- 
duce as many four letter anagrams from 
the test word as they could. The pro- 
ductions were scored for quantity 
(number of acceptable answers) and 
originality (a weighted score for each 
response was developed from the fre- 
quency with which the response was 
given by the 40 subjects). The correla- 
tion of the RAT with the quantity 
scores was .44 (p < .01) ; the correla- 
tion of the RAT with the originality 
score was .37 (p< .05). Kowalski 
presented the anagrams task to 15 high 
RAT scorers and to 15 low RAT 
scorers, giving them 5 minutes to pro- 
duce words of any length from the test 
word *Generation." In this study orig- 
inality was measured by computing the 
percentage of responses given by an in- 
dividual which had not been given by 
any other of the 30 subjects. The dif- 
ference on this measure between high 
and low RAT scorers was significant 
(U =68, p < .04). "Only four sub- 
jects in the low creative group gave 
any original responses at all while 
eleven subjects in the high creative 
group did" (p. 19). 

At the Institute of Personality As- 
sessment and Research, University of 
California, Berkeley, the RAT was in- 
cluded as part of the assessment battery 
administered to a group of 40 highly 
eminent architects. The RAT corre- 
lated .31 with the Originality (O-I) 
Scale of the IPAR Questionnaire Scale 
and —.31 with the total Conformity 
Score obtained in the  Crutchfield 
Conformity Experiment (Crutchfield, 
1955). Interviewers rated high scorers 
as significantly higher in “graceful and 
well-coordinated in movement” and 
“reticent and taciturn in speech.” The 
college grade point average which the 
subjects reported correlated —.34 with 
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RAT score, a result which tends to 
confirm findings reported above. 


EXPERIMENTALLY MANIPULABLE 
VARIABLES 


While only one experimental study 
(described below) which makes use of 
this general framework has been com- 
pleted in this laboratory, it may be use- 
ful to indicate briefly the kinds of ex- 
perimental investigation it suggests. 

Massed vs. Distributed Work Ses- 
sions. Total time of work being equal, 
massed sessions of creative work should 
be more successful than distributed ses- 
sions. There are two reasons why this 
would be so. The first is that the indi- 
vidual making use of the massed session 
technique is more likely to achieve tem- 
poral contiguity of the requisite associ- 
ative elements within a single intensive 
work period than is an individual who 
has distributed his work in shorter 
periods over several days. Secondly, it 
may take some time for an individual 
to work on a problem enough to go 
beyond its obvious aspects. In the first 
hour of work, he may get through only 
the conventional and stereotyped asso- 
ciations to the elements of the problem, 
while it is perhaps in the later stages of 
intensive work on a problem that one 
can begin to entertain the more remote 
associations that are evoked by ele- 
ments of the problem. It is, of course, 
among these remote associations that 
the key to the creative solution will lie. 

Warmup. In creative work a warmup 
session should serve to arouse the more 
remote associations to the requisite ele- 
ments of the problem. While their 
work has gone considerably beyond the 
problem of warmup, Maltzman, Bo- 
gartz, and Breger (1958) have demon- 
strated that the repeated elicitation of 
different word associations to the same 
stimulus words does indeed tend to 
roduce remote associations to bs 
stimulus words. Further, this induce 
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originality tends to transfer to other 
relatively unrelated tasks presented 
after this associative warmup. Associ- 
ative warmup of this type should be- 
come more effective as the warmup 
stimuli are more similar to the task ma- 
terials. It may be that the effects of 
warmup will prove to be a further ad- 
vantage that massed sessions have over 
distributed sessions for creative pro- 
ductivity. 

Stereotyping Associalive Responses. 
As stated above, if an individual’s as- 
sociative response to a stimulus element 
of a creative problem is of excessive 
strength, this will tend to reduce the 
likelihood of occurrence of more remote 
associative responses. This will reduce 
the probability and speed of creative so- 
lution. It would therefore be predicted 
that extensive training of a specific re- 
sponse other than a requisite one to a 
stimulus element of a problem requiring 
a creative solution should retard later 
attempts at solution of the problem. 
This prediction is related to the con- 
cept of “functional fixedness" intro- 
duced by Duncker (1945). Birch and 
Rabinowitz (1951) and Adamson and 
Taylor (1954) completed experiments 
which are related to this prediction. 
Their test situation was the two string 
problem. The subjects are asked to 
tie together two strings suspended from 
the ceiling. When the subject grasps 
one string he finds that the other string 
is hanging out of his reach, The solu- 
tion to the problem requires the sub- 
Ject to attach a weight to one of the 
strings, get the weight swinging and 
catch it while holding the other string. 
Various objects can be used as a 
weight. The subjects that had been 
pretrained by having them use a switch 
in its usual manner of functioning 
tended to be unlikely to use it as a 
weight. They had developed strong re- 
sponse strength for the association 
"switch-close circuit" which had re- 
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duced the probability of the remote as- 
sociation “switch-weight.” 

Another feasible experimental ap- 
proach would make use of the RAT 
item as a creative task and test the in- 
fluence upon. it of certain variables. 
For example, the words of which an 
item is composed may be presented at 
varying rates to test the massed trials 
hypothesis. In addition, various pre- 
training conditions may be evaluated in 
terms of their effectiveness in increasing 
or decreasing RAT performance. 

Another possible experimental ap- 
proach would entail separating out high 
and low RAT scorers and observing 
the differential effect of certain vari- 
ables upon their behavior. In an ex- 
periment just completed Houston and 
Mednick (in press) postulated that an 
important motive impelling the be- 
havior of the creative individual was a 
need for improbable associative stimu- 
lation, It was reasoned that if such 
stimulation were supplied, it would 
tend to satisfy this need and be rein- 
forcing. Further, if such stimulation 
regularly followed a given response the 
high creative individual should tend to 
learn that response. Thirty high and 
30 low RAT scorers were asked to 
read aloud only one of two typed 
words on a 3X5 card. Excepting 
buffer items and including 40 pairs 
aimed at gauging the free operant 
level of noun-choice, there were 160 
pairs of words, each pair consisting of a 
noun and a nonnoun (verb, adjective, 
adverb, etc.). Ifa subject in the experi- 
mental group (15 high RAT subjects, 
15 low RAT subjects) responded with 
the noun member of a pair, the experi- 
menter responded with an improbable 
association; if the subject chose the 
Nonnoun, the experimenter responded 
TS the most probable association. In 
HE control group (15 high RAT, 15 
EA RAT) both the nouns and the 

nouns were invariably followed by 


their most probable associate. Asso- 
ciative probabilities were obtained from 
the Russell and Jenkins (1954) and 
Deese (1960) norms. If the improba- 
ble response was satisfying a need, 
the probability of noun-choice should 
increase over the 160 trials. It did 
significantly in the high RAT experi- 
mental group; the low RAT experi- 
mental group showed a decrease. The 
high and low RAT control groups 
showed no reliable change. 

Some of the positions which have 
been taken in this paper are assump- 
tions and not deductions. As more 
data are gathered some of these as- 
sumptions will assume the status of 
facts, some will be revised. For ex- 
ample, the opening paragraph suggests 
that the paper is not meant to apply 
only to one field of creative endeavor 
but attempts to delineate processes 
that underlie all creative thought. This 
may require modification. The expla- 
nation may fit the process of scientific 
discovery and not be appropriate to 
discussions of painting or music. For 
the present (paradoxically enough), the 
more encompassing assumptions seem 
more parsimonious. It may eventually 
turn out (as is hinted at in the body of 
the paper) that the differences between 
the fields are more determined by dif- 
ferences in suitability of the three 
means of achieving contiguity, ie. 
serendipity, similarity, and mediation. 


SUMMARY 


An associative theory of creative 
thinking has been outlined. Differences 
between high creatives and low crea- 
tives have been predicted along speci- 
fied dimensions. Predictions have been 
made regarding the effect on the crea- 
tive process of some experimentally 
manipulable variables. 

The associative definition of the crea- 
tive process has taken the operational 
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form of a test. Some preliminary re- 
search with this test is described. 
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Letter combinations such as nonsense 
syllables and trigrams (three letter 
combinations that may or may not form 
words or syllables) have a number of 
characteristics found to influence rate of 
learning. Those characteristics that 
have been investigated most thoroughly 
may all be referred to as frequency vari- 
ables. "These variables are: 

1. Frequency, per se: A number of 
possible measures of frequency exist. 
Some trigrams form words. For these 
trigrams, Thorndike-Lorge (1944) fre- 
quencies can be obtained. Some tri- 
grams and nonsense syllables form 
sounds or syllables that occur in the 
English language, and it is possible to 
obtain some measure of the frequency 
of this occurrence. The mav-Lav dif- 
ferentiation of nonsense syllables ac- 
cording to whether or not they occur as 
the first three letters of English words 
is essentially a frequency measure. It 
has proven useful in learning experi- 
ments in which nonsense syllables are 
used as stimuli (e.g, Lindley, 1960) 
and might also serve as a trigram fre- 
quency measure. Finally, the frequency 
of a trigram or nonsense syllable might 
be defined as the frequency that the 
three letters of any given stimulus oc- 
cur in contiguity in English words. 

2. Association value (a): The a of 
a stimulus is generally established by 
determining the proportion of subjects 
who have an association (even if they 
cannot tell what this association is) to 
a stimulus within a rather short (2-4- 
Second) time interval. Association 
Value does not directly measure the 
Number of associations evoked by a 


stimulus, although syllables that evoke 
an association in a high proportion of 
subjects generally evoke more associa- 
tions, if presented for a longer time 
interval, than do syllables for which few 
subjects have any associations. 

3. Meaningfulness (m): Noble 
(1952) has proposed as a measure of 
m the mean number of associations 
(excluding clang and certain other cate- 
gories of association) that a verbal 
stimulus can evoke from subjects ina 
fixed time interval. While a is estab- 
lished by determining whether a stimu- 
lus can evoke an association, 7 is 
arrived at from the mean number of 
these associations. 

All of these frequency variables in- 
fluence rate of verbal learning. All of 
these variables are substantially related 
to one another. The frequency of oc- 
currence of nonsense syllables as three 
letter sequences in words is related to 
the a of these syllables (Underwood, 
1959), and the Thorndike-Lorge fre- 
quency of words is closely related to m 
(Johnson, Frincke, & Martin, unpub- 
lished ! ; Underwood, 1959). Associa- 
tion value (a) and m are also related 
(Noble, Stockwell, & Pryor, 1957). 
While these three measures are related, 
it seems dangerous to assume that be- 
cause of this relation they are synony- 
mous. Aside from nonequivalent oper- 
ations used in obtaining these measures, 
we have ample precedent in studies of 
the relation of m and familiarity 


1 Johnson, R. C., Frincke, G., & Martin, 
Lea. Meaningfulness, frequency, and affec- 
tive character of words as related to response 
measures. Unpublished manuscript, 1960. 
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(Noble, 1953) and of m and a (Noble, 
Stockwell, & Pryor, 1957) for sus- 
pecting it to be in error to lump these 
measures together as equivalent or 
synonymous. 

The most ambitious recent study of 
verbal learning is probably that of Un- 
derwood and Schulz (1960) as de- 
scribed in their book, Meaningfulness 
and Verbal Learning. A number of the 
experiments reported by Underwood 
and Schulz involve the learning of lists 
of trigrams by various groups of sub- 
jects. For Underwood and Schulz, a 
trigram consists of any three successive 
letters in a word. For example, the 
word "learning" contains the following 
trigrams : LEA, EAR, ARN, RNI, NIN, ING. 
Trigrams were obtained from a large 
sample of words. Trigram frequency 
measures were obtained, with frequency 
being defined as the number of times that 
the various three letter combinations 
occurred in contiguity in the sample of 
words from which they were obtained. 
The term meaning or M is used inter- 
changeably to denote meaningfulness 
(m) and association value (a). While 
frequency and M are discussed sep- 
arately by Underwood and Schulz, the 
assumption that they appear to make is 
that the relation between these two (or 
three) variables is so close that separate 
analyses of the data regarding these 
variables are unnecessary. Pronuncia- 
bility ratings were obtained from sam- 
ples of subjects for those trigrams used 
in a number of learning experiments. 

The trigram experiments center 
around the problem of determining 
whether frequency-M or pronunciabil- 
ity can account for the majority of 
variance in rate of verbal learning. In 
the first of Underwood and Schulz's 
trigram learning experiments (Experi- 
ment 6, p. 144ff.), four lists of eight 
trigrams each were learned by M ee 
samples of subjects. Taking the lists 


as a whole, frequency was significantly 
related to rate of learning, but pro- 
nunciability showed an even closer re- 
lation. Three lists of 12 trigrams each 
served as stimuli in Experiment 11 
(p. 169ff.). No significant relation 
was obtained between trigram fre- 
quency and rate of learning. The re- 
lation between trigram pronunciability 
and rate of learning was highly signifi- 
cant. Underwood and Schulz say, 
"Clearly, we have no alternative but to 
conclude that for these lists the relation 
between frequency and learning is es- 
sentially zero" (p. 171). Other experi- 
ments yielded similar results and Un- 
derwood and Schulz state that M is 
unrelated to rate of learning. They say : 


Rather, we believe that the apparent casual 
status of M derives from a certain amount of 
covariation with pronunciability, and if this 
covariation is removed, only the relationship 
between pronunciability and learning will 
hold up (p. 192). 


(This removal of covariance does in- 
deed reduce the relation of frequency 
(and M) to approximately zero—e.g., 
see p. 194. The question, in partial 
correlation, is what variable to partial 
out.) They again refer to pronuncia- 
bility, saying : 

It appears then, quite unexpectedly and quite 
without theoretical direction, that we have 
stumbled upon an attribute of verbal units 
which has more predictive power by far than 


any other attribute we have discovered or 
methodically set about to measure. (p. 197) 


Further investigations, using gener- 
ated trigrams, confirmed the belief that 
of the different variables studied, pro- 
nunciability was the most closely re- 
lated to rate of learning. Underwood 
and Schulz's rejection of frequency, 
meaningfulness, and association value 
as major variables in verbal learning is 
indeed astounding since this rejection 
runs counter to the previous interpreta- 
tions of a most extensive body of data. 


' 
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It is, therefore, something of a disap- 
pointment to find that Underwood and 
Schulz once more accept frequency as 
the key variable in verbal learning on 
p. 289ff. They say that if one defines 
trigram frequency as the frequency that 
the three-letter combination occurs as a 
sound in the English language, rather 
than using frequency of contiguity as a 
measure of frequency, then frequency 
may serve as an adequate prediction of 
rate of verbal learning. 

As pointed out in this paper, there 
are a number of possible measures of 
trigram frequency. One does not have 
to be too gestalt oriented to have con- 
sidered the possibility that the fre- 
quency with which trigrams appear as 
units of English speech is a better meas- 
ure of frequency, in terms of predictive 
power, than is frequency of contiguity 
of three-letter sequences. While Un- 
derwood and Schulz, in their analysis, 
define frequency in terms of contiguity, 
this writer will define frequency as the 
frequency of occurrence of trigrams as 
discrete or relatively discrete sound 
units in spoken English. The purpose 
of this paper is to reanalyze some of 
Underwood and Schulz’s data from this 
point of view. Experiments 6 and 11 
seem representative of their trigram 
learning experiments and shall form 
the basis of the major portion of this 
reanalysis. 


EXPERIMENT 1 


One syllable words form relatively 
discrete sound units. Sounded syllables 
in words form somewhat less discrete 
units, while three letter combinations 
that are neither words nor sounded syl- 
lables are not sound units of English 
speech, It is relatively simple to select 
the three letter words from trigram lists 
used in Experiments 6 and 11. The 
One exception to this simplicity in se- 
lection is the trigram "our," used in 


Experiment 6, which does not appear in 
Webster's New World Dictionary of the 
elmerican Language, but does appear 
among the 30,000 most frequent words 
in the Thorndike-Lorge tables. The 
writer, decided, on the basis of its rela- 
tively high frequency, to count QUE as 
a word. The next problem was to sep- 
arate syllables from nonsyllables. The 
purpose of the first experiment re- 
ported in this paper was to separate 
those trigrams that occur as syllables 
(sound units that do not, in themselves, 
form words) from those that do not 
occur as sound units, and to determine 
whether there are differences in the 
rate that words, syllables, and nonsyl- 
lables are learned. 


Method 


Subjects. The subjects were 51 upper di- 
vision psychology students. 

Materials. The materials were those 52 
trigrams from Underwood and Schulz’s Ex- 
periments 6 and 11 that do not form English 
words. 

Procedure. The subjects were presented 
with the trigrams in a double column on a 
single dittoed page in the same order that 
these trigrams appeared in Underwood and 
Schulz's book. They were told: 


Look at the first of these three letter com- 
binations. Say it to yourself. Does this 
combination occur as a sounded syllable in 
English words? If it does, put a check 
after it. Do this for all three letter com- 
binations on the page. 


Results and Discussion 


Whether due to perversity or to mis- 
understanding, such improbable com- 
binations as cQu were described as 
syllables by several subjects. For this 
reason it was decided to count three- 
letter combinations as syllables appear- 
ing in the English language only if 
rated as being syllables by at least five 
subjects. The words, syllables, and 
nonsyllables from Experiments 6 and 
11 are as follows: 
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EXPERIMENT 6 


Words—was, BUT, MAN, WHO, ART, 
STY, VIZ, QUE 
Syllables—zon, cvm, STI, JOK, EIG, SOU, 
VER, YAL, CHA, ING 
Nonsyllables—rcu, XPE, IFO, TLY, DGM, XPE, 
GHT, ULD, CKB, MPT, NDF, XPO, 
MBK, DFL 


EXPERIMENT 11 


Words—Boy, URN, HER, HAT, CUT, ITS, 

PAR, LED 
Syllables—rLo, vir, ING, EST, WHE, SOU, 
FRO, UND, VER, CHI, ENT, JUM, 

ROP, ISH 
Nonsyllables—xpk, OUS, WHA, ATI, CQU, DGM, 
YIN, ALI, OMP, MPA, YLV, ABL, 

NCE, TIO 


Kruskal and Wallis H tests show 
that the words, syllables, and nonsyl- 
lables used in Experiment 6 differ sig- 
nificantly in pronunciability (H = 21.7, 
df =2, p<.001) and in mean cor. 
rect anticipations (H = 15.3, df=2, 
p < 001). The words, syllables, and 
nonsyllables in Experiment 11 differ 
significantly in pronunciability (H = 
242, df= 2, p< 001) and in mean 
correct anticipations (H = 132, df= 
2, p < .005). 

If we divide the words, syllables, and 
nonsyllables of each list into these three 
categories and divide the trigrams in 
each category into those above and be- 
low the median in Pronunciability for 
their group, we obtain the d 
sented in Figure 1, 

It is apparent that Underwood and 
Schulz, in obtaining pronunciability 
ratings of these trigrams, ranged words, 
syllables, and nonsyllables along a con- 
tinuum, while the experimenter, using 
another technique, divided them into 
three discrete groups. The significance 
figures show that the two measures are 
substantially related. The method used 

in differentiating between these tri- 
grams would depend on the definition 
and the measure of trigram frequency. 
It seems clear, from this graphic rep- 


ata pre- 
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Fic. 1l. Mean percentage of correct an- 
ticipations for nonsyllables, syllables, and 


words above and below median in pro- 
nunciability. 


Syliabies Vords 


resentation as well as from the H tests, 
that trigrams that form words are 
learned more rapidly than trigrams that 
form syllables (a somewhat less dis- 
crete unit of speech)—contrary to Un- 
derwood and Schulz’s belief as stated 
in p. 284.—and that trigrams that form 
syllables are learned more rapidly than 
are trigrams judged by subjects as not 
forming syllables in the English lan- 
guage. When the data are analyzed in 
this way, pronunciability no longer ap- 
pears as a major variable, 

Whether frequency 
ity is the predictor of r 
a result of the definit 
used. If frequency i 
number of contiguous Occurrences of 
three-letter combinations, then pro- 
nunciability seems to be the significant 
variable. Jf frequency is defined as 


or pronunciabil- 
ate of learning is 
ion of frequency 
s defined as the 


ariable while pronuncia- 
y Serves as an indicator as 


to whether a trigram is a word, syllable, 
or nonsyllable, i 


EXPERIMENT 2 
Underwood and 


Schulz, on p. 280, 
State that the freque 


ncy (of contiguity) 
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hypothesis broke down most completely 
in multiple letter units in the broad 
range between the extremes of well in- 
tegrated and poorly integrated units. 
Most of those trigrams rated by the 
subjects as appearing as syllables in the 
English language fall into this middle 
range of integration. For this reason, 
the experimenter investigated further 
the characteristics of those trigrams 
used in Experiments 6 and 11 that were 
rated as being syllables. The experi- 
menter, using the same subjects that 
had rated the trigrams of Experiments 
6 and 11 and using the same procedure, 
also determined which of the trigrams 
used in Experiment 15 (p. 245ff.) 
formed syllables. Five or more sub- 
jects rated each of the following tri- 
grams used in Experiment 15 as occur- 
ring as syllables in the English lan- 
guage: ING, CED, JAD, ITE, SUL, NUW, 
MEL, DOK, NOP, SOM. 


Method 


Subjects. The subjects were 15 introduc- 
tory psychology students. 

Materials. The trigrams from Experi- 
ments 6, 11, and 15 that formed syllables but 
did not form words were placed on dittoed 
sheets, in random order, 10 trigrams to a 
page. 

Procedure. The sheets were passed out to 
the subjects, face down. The subjects were 
cautioned to leave the sheets face down till 
they were told to turn them over. The ex- 
perimenter then told them: 


What I want you to do is help me in a 
learning experiment. You have a list of 
three-letter combinations that form sylla- 
bles. I am going to give you 20 seconds 
for each syllable. In that 20 seconds I 
want you to say the syllable to yourself 
in as many ways as you wish. Then I 
want you to write down, following the 
syllable, all the words you can think of 
that contain the sound of the syllable. For 
example, if the first syllable was L-o-c, 
then you would say this three-letter com- 
bination to yourself and then write down 
the words that contain L-O-G, as it is 
Sounded, for example, log, logic psy- 
chology, logey, and logger. [This example 
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was put on the blackboard.] You are to 
do this for each syllable. When I say 
"start" turn your paper over and begin with 
the first syllable. When I say "stop" stop 
writing. When I say "start" again, begin 
with the second syllable. Go through the 
entire list in this way. Do not look ahead. 


The subjects signified that they understood 
the directions and were administered the list. 
They were timed by stop watch. Their re- 
sponses were scored. 


Result and Discussion 


The mean number of words pro- 
duced for each of the syllables is pre- 
sented in Table 1. 

The rank-order correlation between 
the number of words produced for each 
syllable and the mean number of cor- 
rect anticipations made by Underwood 
and Schulz's subjects for the same syl- 
lable were +.48 for the syllables used 
in Experiment 6; +.64 for the syllables 
used in Experiment 11, and +.75 for 
the syllables used in Experiment 15. 
The last two of these correlations are 
significant beyond the .05 level of con- 
fidence. The responses to this produc- 


TABLE 1 


Mean WORDS PRODUCED FOR 
TRIGRAM SYLLABLES 


Experiment 6 Experiment 11 Experiment 15 
j Mean r Mean j Mean 
fabis | mroduced! Abie | produced! pre | Produced 
ZON 1.27 | PLo 1.93 | ING 4.93 
CYR | 0.87 | viF 0.87 | CED 2.00 
STI 2.53 | ING 4.93 | JAD 1.00 
JOK 2.33 | EST 3.67 | ITE 2.80 
EIG 147 | wae | 2.87 |suL | 2.33 
sou | 2.27 | sou 2.27 |Nuw| 0.40 
VER | 2.53 FRO 2.60 | MEL 247 
YAL 147 | up | 3.00 |poK | 3.13 
CHA 2.67 VER 2.53 | Nor 0.73 
ING 4.93 | cur 3.13 | som 2.87 
ENT 4.33 
JUM 2.53 
ROP 2.07 
ISH 3.40 
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tion technique seems somewhat differ- 
ent than, yet associated with, measures 
of association value, frequency, and 
meaning. The number of words that 
the subjects can produce that contain a 
syllable seems closely associated with 
the rate at which the syllable is learned. 


CONCLUSION 


Underwood and Schulz, defining fre- 
quency as the frequency that trigrams 
appear contiguously in English words, 
found pronunciability to be significantly 
related to rate of learning, while fre- 
quency was not a major variable. This 
experimenter defined frequency as the 
frequency that a trigram appeared as a 
discrete or relatively discrete sound unit 
in spoken English. Using the approach 
taken in this paper, frequency is most 
significantly related to rate of verbal 
learning while pronunciability is not a 
major variable. While it may be that 
other attributes of trigrams (such as 
m or a) which covary with the fre- 


Jonxsox 


quency that trigrams occur as sounds in 
the English language are more basic in 
influencing rate of verbal learning, one 
can say, with certainty, that frequency, 
as defined herein, is an adequate pre- 
dictor of rate of verbal learning. 
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RONALD C. JOHNSON 
University of Hawaii 


CALVIN W. THOMSON 


San Jose State College 


AND GERALD FRINCKE 
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As Stricker (1961) notes, throughout 
our study we (Johnson, Thomson, & 
Frincke, 1960) equate word value with 
the rating of the word on the good-bad 
scale of the semantic differential. He 
says, further, that: 


word value has been used in the literature to 
represent a dimension different from that 
tapped by the evaluative scale of the seman- 
tic differential, and while the two may share 
some commonality, it does not seem entirely 
appropriate to use the latter in order to 
operationally define the former. 


Stricker then demonstrates that word 
value, as defined in previous visual dura- 
tion threshold studies, refers to a moti- 
vational (e.g., Allport-Vernon) and not 
an attitudinal (e.g., semantic differential) 
dimension, and that motivational and at- 
titudinal dimensions of value( ?)or evalu- 
ation( ?) or “X” are probably not synony- 
mous. He says that while we demonstrate 
that there is a relation. between word 
goodness and word frequency (and also 
between both goodness and frequency and 
threshold), we do not demonstrate a rela- 
tion between word frequency and word 
value (as Stricker prefers to define 
value). 

We cannot deny that the operational 
definition of “value” varied between our 
study and that used in previous studies 
conducted in this area. As we see it, 
science consists, to a large degree, of 
figuring out ways of testing the previ- 
ously untestable. If operationally defin- 
ing black as white aids the scientist in 


this endeavor, then it is quite legitimate 
to do so, so long as you make your defi- 
nitions clear. While the word “value” 
has been used in the past to refer to other 
dimensions than those investigated in our 
study, we do not believe that we, or 
others, are bound to traditional usage, so 
long as we define our terms. We plead 
guilty to departing from prior usage of 
the word "value," but do not believe this 
to be reprehensible. 

We believe that our shift in operational 
definition of “value” is defensible on sev- 
eral grounds. The first is that so long 
as value is defined in terms of such di- 
mensions as Allport-Vernon scores, then 
it appears to have been clearly demon- 
strated (Postman & Schneider, 1951; 
Solomon & Howes, 1951) that the two 
variables of word frequency and word 
value are so inextricably intertwined that 
experimental tests of the influence of fre- 
quency and of value on threshold yield 
results that are difficult, if not impossible, 
to interpret in any unequivocal sense. By 
redefining value, we believe that we have 
made the value-frequency problem more 
easily testable than was previously the 
case. If we have, then a redefinition 
seems legitimate. If Stricker prefers to 
use value in another, more traditional 
sense, it is his privilege to do so, and his 
responsibility to design an experiment 
testing the influence of value, as he de- 
fines it, on frequency and on threshold. 

A second defense of our shift in opera- 
tional definition of value also comes from 
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the ground rules of science. | It is gener- 
ally believed that a major aim of science 
is to stimulate further scientific endeavor, 
thus adding to our knowledge. It seems 
likely that many experimenters were dis- 
couraged írom entering the value-fre- 
quency controversy, since after the Post- 
man and Schneider and the Solomon and 
Howes studies cited above it became 
fairly apparent that determining the in- 
fluence of value on threshold was difficult, 
if not impossible, so long as value was 
defined in terms of  Allport-Vernon 
Scores, As a result, little experimentation 
was done in this area, and for a number 
of years the frequency explanation was 
(quite correctly, on the basis of the avail- 
able evidence) generally accepted as 
being the most parsimonious one avail- 
able in dealing with the results of visual 
duration threshold experiments. Recent 
experiments (Johnson, Frincke, & Mar- 
tin, 1961; Newbigging, 1961a, 1961b) 
stimulated, in part, from the study and 
from the definition of value with which 
Stricker finds fault, suggest that fre- 
quency is not a significant variable, while 
value, as we define value, is significant 
(although there is some question as to 
why it is significant) 
threshold. 
We cannot believe that any redefinition 
of word value that serves to increase the 


in determining 


testability of certain experimental ques- 
tions, generates research, and provides us 
with a different and, we believe, more 
correct view of reality, is to be rejected 
merely because it violates traditional 
usage. 
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AN APPLICATION OF LEVINE'S MODEL FOR HYPOTHESIS 
BEHAVIOR TO SERIAL REVERSAL LEARNING* 


BENJAMIN H. PUBOLS, Jr. 


University of Wisconsin? 


Recently, Levine (1959a, 1959b) de- 
scribed a mathematical model for hy- 
pothesis behavior in discrimination learn- 
ing-set experiments. By defining a 
hypothesis (H) as “a specifiable pattern 
of response to a selected stimulus set" 
(1959b, p. 353), and making certain 
assumptions regarding their nature and 
conditions of occurrence, Levine was able 
to evaluate the strengths of various hy- 
potheses (e.g., position preference, stimu- 
lus preference, problem solution) through- 
out learning-set formation, to analyze 
learning-set curves in terms of the con- 
tributions of component Hs, and to syn- 
thesize learning-set surves given H 
strengths. In his dissertation (19592, 
P. 58), Levine suggested that this ap- 
proach might also be appropriate to a 
description of the finding of North 
(1950), Pubols (1957), and others, that 
rats show progressive improvement in 
serial position reversal learning. 

The published model is not directly 
applicable to serial reversal learning 
data, but since its appearance Levine? 
has developed a general model for hy- 
pothesis behavior, applicable to both 
learning-set and serial reversal experi- 
ments, A recent experiment by the 
writer (Pubols, 1962) yielded data in 
terms of which this model might be 
tested. 

In this experiment, three groups of 10 
rats each were given 410 trials in a 


1This investigation was supported by 
PHS Grant M-4046(A), National Institute 
of Mental Health, United States Public 
Health Service. The author would like to 
express his appreciation to Marvin Levine 
and Alfred Etaugh for critical readings of 
Preliminary drafts of this paper. 

* Laboratory of Neurophysiology, Depart- 
ment of Physiology. 
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one-unit alley Y maze. Members o: 
Group 10 received 10 trials per reversal 
for original learning and 40 reversals 
of a position habit; members of Group 
20 received 20 trials per reversal for 
original learning and 20 reversals; and 
members of Group 40 received 40 trials 
per reversal for original learning and 
10 reversals (in all cases, only 10 trials 
were given on the final reversal, as this 
sufficed for measurement purposes). Em- 
pirical results of the study have been 
reported elsewhere (Pubols, 1962). Here, 
we will be concerned with the applica- 
tion of Levine's model to the serial 
reversal data. But first, a brief sum- 
mary of the model seems in order. 


SUMMARY OF Levine’s GENERAL MODEL 


Levine's model is restricted to the 
operation of H behavior over the first 
three trials of a problem. He makes the 
following assumptions: (a) if an H oc- 
curs on a problem, it will be manifest 
over all three trials of that problem ; (b) 
while the determiners of response on 
Trial n may include stimuli associated 
with Trial à — 1 (as in single alterna- 
tion), the response will be independent 
of Trial » —2 stimuli (thus excluding 
from consideration an H such as double 
alternation); (c) all Hs are mutually 
exclusive; (d) the sum of the proba- 
bilities of all operative Hs is one; (c) 
the selection and definition of Hs for a 
given application is independent of the 
characteristics of the model; and (f) 
"a given H has the same strength no 
matter in which cell [sequence] it ap- 
pears" (Levine, 1959b, p. 358). 

The operations involved in estimating 
H strengths differ somewhat for learn- 
ing-set and for serial reversal data, and 
hence these will be considered separately. 
For any three-trial problem in the learn- 
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ing-set situation, there are four reward 
sequences, four response sequences, and 
two conditions of Trial 1 outcome (re- 
warded or nonrewarded). This yields 
32 possible sequences, or cells. The pro- 
cedure for evaluating H strengths basi- 
cally involves two steps. The first step 
utilizes a simplifying assumption, which 
may be called the assumption of sym- 
metry: in a learning-set experiment, 
each problem involves a new pair of 
discriminanda, and hence it is reasonable 
to assume that, on each problem, the 
probability of a Trial 1 positive sequence 
equals the probability of a Trial 1 nega- 
tive sequence: 


P(+1)=P(-) 


This being the case, the first step in- 
volves the calculation of conditional 
probabilities of empirical sequences, the 
probability of the empirical outcome on 
Trials 2 and 3, given the reward se- 
quence and outcome on Trial 1, Opera- 
tionally, the frequency of Occurrence of 
a given sequence is divided by the fre- 
quency of occurrence of all sequences 
having the same reward sequence and 
Trial 1 outcome. The second step then 
involves the simultaneous solution of N 
equations, representing the NV hypotheses 
whose strengths are being evaluated, 
taking into account differences in the 
number of cells associated with each H. 
The final result, then, is a st 
the probability of each H in 
of problems analyzed. 

Turning now to seri 
the total number of 
from 32 to 8. This follows from the 
fact that there is only one reward se- 
quence involved; that is, on a given 
reversal, reward is always on the same 
side or position, Further, Tesponse se- 
quence and reward outcome become 
completely confounded, Defined in terms 
of reward outcomes, the eight serial re- 
versal sequences are, for Trial 1 nega- 
tive: ———, Eus —, and 
=F; for Trial j positive: +++, 
Tc, +— +, and +--, 

In the case of serial reversal (and 
also, incidentally, oddity learning) ex. 


atement of 
each block 


al reversal data, 
Sequences reduces 


periments, the assumption of symmetry 
is not met. As serial reversal learning 
progresses, the probability of a Trial 1 
negative sequence increases while that 
of a Trial 1 positive sequence decreases ; 
hence, an H will come to have a greater 
strength in the former case. When the 
assumption of Symmetry is not met, 
Levine* has shown that the use of con- 
ditional probabilities of empirical out- 
comes is not appropriate; rather absolute 
probabilities should be calculated. 

Thus, for the evaluation of serial re- 
versal data, absolute probabilities of 
Sequence occurrence should be utilized. 
The absolute probability of a given se- 


quence is defined as the ratio of the fre- 


quency of that Sequence to the frequency 
of all sequences: 


P (ith sequence) 


— frequency of ith sequence ] 
frequency of all sequences 


APPLICATION oF THE GENERAL MODEL 


The analysis of H 
data of the Pubols ex 


» yielding 50 Scores per 
block (5 reversals x 10 


gives, for Groups 10, 20 and 40, re. 
Spectively, eight, four, and two blocks 
for analysis, Table 1 outlines the hy- 
potheses chosen for analysis, along with 


other relevant information, Equation 1 
now becomes: 


P(ith sequence) 


Li frequency of ith 


sequence 
m 
After these Sequence probabilities are 
computed, y, a, 


4.5 usual way, and the final 
Probabilities are defined as R=&, 


2 m 
~<a, and B= 2p, P» and p, remaining 
unchanged, 


Results of the 


application of the model 
are shown in Fi 


gures 1 and 2, Figure 1 
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TABLE 1 


THE HYPOTHESES EVALUATED IN THE PRESENT ANALYSIS 


Hypothesis Ed Associated sequences NU a 
Random responding R All sequences 8 
Position preference A aay [Ii 2 
Position alternation B —-+-,+-+ 2 
Trial 2 problem solution ba Se 1 
Trial 3 problem solution Ps Sat 1 


Presents the experimental data as ana- 
lyzed into the various Hs. Inspection 
of the figure reveals an interesting hap- 
pening. Whereas all three groups show 
Interreversal learning (progressive in- 
crease in po+ pg), this learning is cor- 
related, in different groups, with the 
diminution of different “incorrect” Hs. 
1 hus, for Group 10, interreversal learn- 
Ing is accompanied by a systematic de- 
crease in position alternation, B; for 
Group 20, there is a decrease primarily 
in random responding, R; and for Group 
40, it is position preference, A, which 
Systematically decreases. 

In order to obtain some indication of 


PROBLEM SOLUTION (p,+p,) 


d 


POSITION PREFERENCE (A) 


| g $ 45 6 wv is 


the reliability of the trends just de- 
scribed, each curve of Figure 1 was sub- 
jected to an individual trend test For 
these tests, H strengths in each rat on 
each block of reversals were computed, 
and the error term was the blocks by 
subjects interaction. As expected, in 
Group 10, the trend in the problem solu- 
tion curve was significant (p < .01), as 
was the trend in the position alternation 
curve (p < .001). For Group 20, only 


5 Because of different numbers of reversal 
blocks in the three groups, and because 
within an individual subject all hypotheses 
are not statistically independent, it was not 
possible to run any over-all trend tests. 


e—o GROUP IO 
o—o GROUP 20 
4——^ GROUP 40 


NI. 


A—A 
RANDOM RESPONDING (R) 


POSITION ALTERNATION (B) 
Dog 41S: € x 8 


REVERSAL BLOCK 


Fic. 1. 


Hypothesis probabilities as a function of reversal block. 
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GROUP 20 GROUP 40 


4 l2 


ial reversal functions, in terms of 


Trial 2 percentage correct. 


the trend in the problem solution curve 
was significant (p < .05), and for Group 
40, only the trend in the position prefer- 
ence curve was significant (p < .05). 
The failure of significance in the case 
of the Group 20 random responding, and 
the Group 40 problem solution curves 
can probably be attributed to the loss in 
degrees of freedom resulting from the 
combination of reversals into blocks. 
Certainly, in the case of Group 40, there 
is independent evidence (Pubols, 1962) 
that interreversal learning was genuine, 

It seems that, as the number of trials 
per reversal increases, the initially 
dominant hypothesis changes from posi- 
tion alternation, possibly to random re- 
sponding, and then to position preference, 
The strengths of these Hs were, in the 
indicated groups, high from the very 
first reversal, suggesting the importance 
of the number of trials on the original 
problem. In the case of Group 40, the 
position preference hypothesis manifests 
itself exclusively in the sequence, — — — 
which indicates Strong perseveration of 
the formerly correct response. — With 
fewer trials per reversal, this persevera- 
tive effect is less strong and incorrect 
responding is manifest in an alternation 
or a random pattern, 

Figure 2 compares conventional (em- 
pirical) serial reversal curves based on 
Trial 2 performance with curves syn- 
thesized according to Levine's Equation 


6 (1959b, p. 363). The fit is seen to 
be best for Group 10, poorest for Group 
40. Trial3 curves were also synthesized, 
but the fit was less close in all three 
cases, primarily by way of overestimation. 


Discussion 


The general model appears to have 
as much utility when applied to serial 
reversal learning as when applied to 
learning-set formation, Where the prob- 
ability of a Trial 1 negative sequence 
equals the probability of a Trial 1 posi- 
tive sequence, the Strength of Hs may be 
evaluated in terms of either absolute or 
conditional probabilities of empirical out- 
comes. In contrast, where Trial 1 posi- 
tive and negative sequences have un- 
equal probabilities, the strength of Hs is 
appropriately evaluated only in terms of 
absolute Probabilities, 

There yet remains one 
between the 
data, 


discrepancy 
Serial reversal 


e in which it appears. 
"IS assumption implies, for example, 
that Pie =P Chk +), and that 
I PSP epe That these 
equalities do not hold follows from the 
fact that P(—)»P(4). However, 
the analytical utility of the model, as 
indicated by Figure 1, and the close 
Correspondence between empirical and 
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synthesized functions, as indicated by 
Figure 2, suggest that, in practice, this 
may not be a serious shortcoming. 

Levine (1959b, pp. 305-366) compares 
explanations of “learning to learn” that 
have been suggested in the literature. 
Harlow's view (1959; see also Harlow 
& Hicks, 1957) is that interproblem (and 
presumably interreversal) learning in- 
volves the weakening and eventual elimi- 
nation of incorrect response tendencies, 
or, in the present context, Hs not con- 
Sistently reinforced. An opposing view 
(Restle, 1958) emphasizes the strength- 
ening of rewarded responses (Hs). 
While our data show that increases in 
P» and pa are accompanied by decreases 
in the strength of other Hs, it cannot be 
concluded, as Harlow implies, that the 
latter is the cause of the former. Nor, 
Of course, can the reverse be concluded. 
We are in agreement with Levine 
(1959b, pp. 365-366) that, in terms of 
the model for hypothesis behavior, learn- 
ing to learn is best described as the 
Strengthening of certain Hs through 
100% reinforcement and the weakening 
(extinction) of other Hs through 50% 
reinforcement. 
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A COMPARISON OF THE POWER OF THE U AND t TESTS! 


C. ALAN BONEAU 
Duke University 


In a recent paper (Boneau, 1960), the 
author summarized the results of a num- 
ber of theoretical and empirical studies 
dealing with the effects of violations of 
assumptions underlying the ¢ test. It was 
concluded that the ¢ test is remarkably 
unaffected by the two common violations : 
sampling from populations having un- 
equal variances and sampling from non- 
normal distributions. One who uses the 
t test can be reasonably sure that the 
probability of rejecting a true null hy- 
pothesis is close to the alpha value he 
selects for his experiment even though he 
may have misgivings about the assump- 
tions upon which the t test is based. As 
a result of these considerations, a’ recom- 
mendation was made in the previous 
paper to the effect that even when the 
assumptions are not met (except under 
special conditions) the f test and the F 
test on means of the analysis of variance 
be used without those attendant feelings 
of turpitude which can be attributed to 
an introjection of the strictures of the 
proponents of nonparametric methods. 

This recommendation was based upon 
an assumption that the ¢ and F tests, be- 
cause they make effective use of the in- 
formation in the sample and have other 
desirable properties, should be more pow- 
erful techniques than nonparametric com- 
petitors. (For readable explanations and 
discussions of the power of a test see 
among others Siegel, 1956 or Walker & 
Lev, 1953.) That is to say, if the null 
hypothesis is false—if there are true dif- 
ferences between means—the t test should 
signal the detection of small differences 
by yielding significant results more fre- 
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quently than should comparable non- 
parametric methods. 

Theoretically, the ¢ test is more power- 
ful than any of the usually utilized tests 
when the assumptions underlying it are 
met. It is also true, however, that when 
sampling is from certain nonnormal distri- 
butions, other tests may be more powerful 
than the £ test. The Wilcoxon-Mann- 
Whitney U test (Wilcoxon, 1945; Mann 
& Whitney, 1947) for example, in one 
pathological case, theoretically is infi- 
nitely more powerful than the ¢ test. 
Theoretically also, the power of the U 
test is never less than .83 of that of the 
t test (Hodges & Lehman, 1956). In fact 
even in the case for which the t test is 
designed (normality and equal variances) 
the U test by one measure of the relative 
power of the two tests is 95% as power- 
ful as the ¢ test (Hodges & Lehman, 
1956). 

Such theoretical statements about rela- 
tive power of tests are based upon mathe- 
matical limiting processes involving con- 
ditions which are not representative of 
most practical situations, for example, 
infinitely large sample sizes and 


arbi- 
trarily small differences between popula 
ton means: (Dixon, 1954; Hodges & 


Lehman, 1956; and Mood, 1954). State. 
ments as to the relativ 
eral of various nonpar; 
of the t test are Scatt 
literature (Dixon, 
man, 1956; Lehn: 
to mention 
ones), The 
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ametric competitors 
ered throughout the 
1954; Hodges & Leh- 


ologist musters in an atten 
truth from chaff, 

The present paper is intended to pre- 
Sent the facts (culled from the liter. 
as well as manufactured for the 
about the power of the ¢ test 
ticular, how that power 
the power of a specific 


apt to wrest 


ature 
purpose) 
and, in par- 
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competitor in various practical situations. 
This presentation is meant to temper in 
part the implications of the previous 
paper (Boneau, 1960) that the ¢ test 
should be used whenever possible. It 
would seem that here, as in other areas 
of human endeavor, a little discretion 
may pay off. 

Attention will be focused upon the U 
test, a worthy protagonist whose princi- 
pal strengths vis-à-vis the f test have al- 
ready been mentioned. The U test, or 
equivalent versions of procedures based 
upon ranked scores, has been invented 
Several times in the history of statistics, 
first by Deuchler (1914), but later by 
Wilcoxon (1945), Mann and Whitney 
(1947), among others. (See Kruskal, 
1957, for historical discussion.) As used 
in the present context, the statistic U is 
computed by determining the number of 
Scores in the second sample which are 
exceeded by each score in the first sam- 
ple. The sum of all such counts summed 
Over the scores in the first sample is 
called U, tables for which have been de- 
borra by Mann and Whitney (1947), 
EX ended by Auble (1953), and made 
readily accessible by Siegel (1956). Wil- 
ae T test (1945), although limited 
o equal-size samples, gives exactly 
equivalent results even though it is com- 
Puted in a different fashion. 

_ The null distribution of U may be de- 
rived from the assumption that ranks are 
assigned to the two samples on a random 
basis such that every combination of 
peaks among samples is equally likely. 
“or example, given that 7, the first sam- 
ple size, is 2 and m, is 3, the possible 
values of the two ranks in the first sam- 
ple (assuming no ties) are 1 and 2, 1 and 
3, 1 and 4, 1 and 5, 2 and 3, 2 and 4, 2 
and 5, 3 and 4, 3 and 5, and finally, 4 and 
5. By the definition above, these lead to 
U values of 6, 5, 4, 3, 4, 3,2, 2; 1, and; 0, 
respectively. If all of these combinations 
are equally likeiy—the two samples came 
um the same distribution for example— 

€ probability of getting values as ex- 
eie as 6 or 0 by chance is the sum of 

"e individual probabilities : 


1/10 + 1/10 = 1/ 


By the logic of statistical decisions, how- 
ever, one attributes extreme values of U 
not to these accidents of random sampling 
which occur with known probability if 
the null hypothesis is true, but to actual 
differences between the distributions. In 
actuality, differences in distribution usu- 
ally lead to non-equally-likely combina- 
tions of ranks. Ranks 1 and 2 occur 
together with relatively greater frequency 
as the difference between the means of 
the populations increases and as a result 
more frequent extreme U values occur. 
For this reason U is generally considered 
to be a test of displacement or shift of 
distributions, the main focus being on 
differences in central tendency. Note 
also that discrepancies of variance tend 
also to produce non-equally-like combi- 
nations of ranks. If two populations 
have the same mean, the values 1 and 5 
tend more to occur together as the size 
of the variance of the first increases rela- 
tive to the size of that of the second. 
However, these combinations give rise to 
middling values of U and hence are ig- 
nored, in effect, by the U test. We shall 
see, however, that there are cases for 
which differences in distribution other 
than central tendency affect the value 
of U. 

One further statement might be made 
about the sensitivity of the U test in spe- 
cific cases. Since essentially it is based 
only upon ranks, first and last scores get 
rank 1 and s whether they are close to 
the mean or several standard deviations 
away. Thus the occasional score which 
is apparently not in the distribution but 
which furnishes no real justification for 
exclusion is treated as a member of the 
ingroup by the U test but as the pariah 
it may well be by the t. 

The method of the present paper is 
quite similar to the approach followed by 
the author in the previous paper dealing 
with the probability of rejecting the null 
hypothesis if it is true. In that study 
populations having specified characteris- 
tics were constructed and the values of 
t arising from the differences between 
means of random samples drawn from 
them were computed. The empirical 
probability of rejecting the null hypothe- 
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is was obtained by determining the pro- 
ben of sample f£ values falling outside 
the ordinary tabled values for the appro- 
priate number of degrees of freedom (i.e., 
falling in the region of rejection). Since 
the null hypothesis was indeed true, these 
empirical probabilities or proportions 
could be compared with the nominal val- 
ues to determine the effects of modifying 
the specified characteristics of the popu- 
lations in such a way as to violate the 
assumptions underlying the ¢ test. In the 
present study, the concern is with the 
proportion of obtained f’s and Us falling 
in the region of rejection (or critical re- 
gion) when the null hypothesis is false— 
when there is a built-in, specified differ- 
ence between means. To generate this 
information, the only required addition 
to the previous ¢ test program is a pro- 
vision for changing the mean of the first 
sample to any value desired. 

The program using the IBM 650 Elec- 
tronic Data Process System for generat- 
ing /'s from random deviates was dis- 
cussed in detail in the previous paper 
(Boneau, 1960). To summarize briefly, 
10-digit random numbers were generated 
by a multiplication process. These were 
converted into random deviates from a 
specified population by a table-look-up 
procedure; the random deviates were in- 
jected into the computing formula for 
the ¢ test for the difference between 
means of two independent samples; and 
the resulting £ value was sorted and 
tallied on an internally contained table 
within the computer. 

The program designed for the present 
study for the U test utilized the existing 
random number and random deviate gen- 
erating procedures. The value of Ọ was 
computed by the simple expedient of sub- 
tracting every score in the first sample 
from every score in the second sample 
and counting the number of minus signs 
which resulted. This number is U by 
definition. The possibility for a tie was 
ignored since the expected rate of ties 
was approximately one per thousand Us, 

a rate which, while significant, would 
seem to have little effect on the observed 
results. As in the ¢ program, the obtained 
U values were sorted and tallied on an 


internally contained table which was 
punched out in card from when the de- 
sired number of Us from the specified 
populations was reached. 


REsuLts 


Since the results of the study will be 
expressed in terms of empirical power 
functions, the investment of a small 
amount of space to elaborate on the 
method of their determination may be in 
order at this point. 

For any given set of conditions, i.e., 
combination of means, variances, and dis- 
tributions, the result of the computer pro- 
cedure is a set of ts or Us which may 
be arranged in a frequency distribution. 
Figure 1 shows two such distributions. 
The distributions are of ?’s obtained on 
the basis of sample sizes of 15 from nor- 
mal populations having a variance of 1.0. 
One of the distributions shown, that cen- 
tered around the ¢ value of zero, arose 
when the means of the two samples were 
both equal to zero, For the other distri- 
bution, that centered around 2.8 and 
shaded, the difference between the means 
of the two samples was equal to 1.0, that 
is, equal to one standard deviation, The 
vertical lines divide the Tange of possi- 
ble t values into two regions. The un- 
hatched region marks off those values of 
t which result in a decision to accept (or 
fail to reject) the null hypothesis at the 
05 level. The hatched regions, those 
values of ¢ which result in a decision to 
reject the null hypothesis at that level. 
As can be seen, when the null hypothesis 
1s true most of the obtained ?’s fall in the 
region of acceptance. The proportion 
which fall in the region of rejection, in 


this case 049, we have called the empiri- 
cal alpha level, 


Sometimes a 


decision is made to accept 
a false null h 


hypothesis, the So-called beta 
igure 1, such an error would 
the case of those values of t 
haded distribution which fall 
n of acceptance. On the other 
hand, decisions to reject the null hypothe- 
Sis when it is indeed false occur for th 
Proportion of the f's from the shaded dis- 
tribution Which fall in the region of re- 
Jection, in this case ,737, It is this pro- 


in the regio 


p 
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Fic. 1. Empirical ¢ distribution for mean difference of (a) 0.00 (unshaded area) and 


(b) 1.00 (shaded area) for sample sizes of 15. 


tion for the .05 level of significance.) 


an which we call the empirical power 
ilio ve under the given conditions. It 
Mi te be noted that both the empirical 
bse and the empirical power are esti- 
Wi tes of the exact theoretical values 
hich obtain for these conditions. 
Ms data of the present study are 
fu pirical power values considered as 
nctions of the actual difference between 
ari means, We shall consider 
: parately, the functions for the one- and 
Wo-tailed tests and for the 05 and .01 
veies of alpha. The figures which will 
pe presented will, for a given set of con- 
ditions, depict such functions for both 
that t and U tests so that visual com- 
Parisons may be made. 

One further thing should be said about 
nomenclature before results are presented. 
As in the previous paper, the conditions 
of sampling will be symbolically repre- 
Sented. For example, N(2,1)5-N (0,1) 15 
indicates that the first sample is from a 
normal population N with a mean of 2 
and a variance of 1, the sample size being 
k In this instance, the second sample 
E i a normal distribution having a 
E of 0, a variance of 1, and the sample 
will is 15, In the text and figures, we 

use the variable “x” for the value of 


(The hatched area is the region of rejec- 


the first mean to indicate that it takes on 
the several values necessary for the 
p, — p, values on the abscissa. In all 
cases the value of the mean of the second 
distribution is zero. One thousand Us 
and t's were obtained for each condition. 


Normal. Distribution: Homogencous 
Variances 

First to be considered are the cases 
in which the sizes of both samples are 
the same. Figure 2 depicts the vari- 
ous empirical power functions obtained 
when the condition of sampling is 
N(x,1)5-N(0,1)5. Because of the dis- 
crete nature of the U distribution, exact 
alpha values of .05 and .01 are not pos- 
sible. When both samples sizes are 5, the 
following alpha values obtained from 
tables are used: .056 for the two-tailed 
test and .048 for the one-tailed test in 
place of .05, and .008 for both the one- 
and two-tailed test in place of .01. 

A number of interesting facts can be 
observed in Figure 2. First is the ex- 
pected superiority of the one-tailed test 
to the two-tailed test. Of prime impor- 
tance for the present paper is the remark- 
able lack of difference in power of the 
t and U tests over the range of values 


Fic. 2. Empirical power functions for U 
and ¢ tests with sampling scheme N(x,1)5- 
N(0,1)5. 


presented. Only in the case of the two- 
tailed at the .008 level does the ¢ test 
seem to exhibit a definite superiority and 
here only for gross difference between 
the means. In most of the other cases, 
the differences between the obtained 
points are well within the margin of 
sampling error (i.e., not significantly dif- 
ferent). This would indicate that al- 
though the ¢ test is theoretically a uni- 
formly (over all mean differences) most 
powerful test, the margin over the U test 
is not very much. Because of this power 
property of the ¢ test under these condi- 
tions, the points at which the U test 
shows a Superiority must be attributed 
to sampling error. It is also clear that 
the alpha values (i — pe —0) are vir- 
tually identical and are approximately the 
theoretical values expected, 

Increasing both sample sizes from 5 to 
15 tends, as shown in Figure 3, to in. 
crease the power of the test, but to leave 
virtually unaffected the things which we 
have said about the smaller sizes, For 
example, at the .05 level for a one-tailed 
test, the power of both tests for a differ- 
ence between means of 1.0 is around .42 
for sample sizes of 5, and .85 for samples 
of size 15, 

Still maintaining the condition of sam- 
pling from a normal distribution with 
equal variances of the parent populations, 
we may allow the sample sizes to be dif- 
ferent, 5 as opposed to 15, and generate 
another series of curves, Figure 4. The 
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Fic. 3. Empirical power functions for U 


and ¢ tests for sampling scheme N (x,1)15- 
N(0,1)15. 


results are similar to those preceding, 
with the power for the one-tailed test at 
the .05 level and a mean difference of 1.0 
being in this case about .60, This, too, 
is to be expected since the power of the 
test among other things is an increasing 
function of the dj between the 
means but a decreasing function of the 
expected standard error of the difference 
between means where 


[UE Yon, + oos*/n, 
Figure 4 shows more clearly than 
the preceding graphs the Superiority of 
the ¢ test to the U test, but the largest 
obtained difference is only .12, and being 
the largest probably Overestimates the 
true difference, 
The results we have observed thus far 
are for conditions in which the 


any of 


assump- 
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tions underlying the / test are satisfied. 
Under these conditions we know that 
theoretically the ¢ test is the most power- 
ful test of the difference between two 
means. The power functions and rela- 
tions between the powers of the two tests 
obtained in this study are comparable 
with those found in several sources 
(Dixon, 1954; Dixon & Massey, 1957; 
Ferris, Grubbs, & Weaver, 1946, to name 
a few). Thus they contribute only a con- 
firmation of the general method and at 
the same time furnish a graphical demon- 
ian of the fact that, while less power- 
ul than the ¢ test, the U test performs 
2 well in those situations for which 
the ¢ test is expressly suited. 
Mia the assumptions of the ! test 
m 2 fulfilled, it is not necessarily a 
m powerful test. Moreover, as stated 
fe ler, there are theoretical conditions 
Or which the ¢ test is considerably less 
Powerful than other tests including the 
i Fini The remainder of this paper will 
munia power functions for the U and t 
whid or those violations of assumptions 
ilis a arise from various combinations of 
a e distributions, the two variances, 
lect two sample sizes which have been 
= ded for the study. We will determine 
Er the power functions for the ¢ test 
any drastic deviations from those 
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power functions we have already seen, 
and we shall discover those cases, if any, 
for which the U test performs better than 
does the ż test. This, of course, must be 
within the limitations imposed by the 
selection of conditions. 


Normal Distributions: Heterogencous 
Variances 


Initially, we shall proceed by examin- 
ing the effect of violating the assumption 
of homogeneity of variances with normal 
distribution. In the previous study, it 
was determined that inequality of vari- 
ances up to at least a ratio of 1 to 4 pro- 
duced a very little effect in alpha provided 
the sizes of the two samples are the same. 
If the sample sizes are different, gross 
disturbances in alpha occur. 

Figure 5, depicting the equal-sample- 
size case (n= 5), reveals that for ¢ the 
alpha is relatively undisturbed and power 
also seems little affected. It should be 
noted that the relatively low power for 
Figure 5 (and for some following fig- 
ures) for a given mean difference is to 
be attributed to a change in the standard 
error of the difference between means 
occasioned by the change in g, from 1 
to 4. For a given mean difference in 
standard error units, the power remains 
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Fic, 5, Empirical power functions for U and ! tests for sampling scheme N (x,1)5-N (0,4)5. 
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ially the same. For example, the 
veia Ra error of the difference be- 
tween means for the case N(x,1)5- 
N(04)5 is .63, while for the case 
(x,1)5-N(0,4)5 it is 1.0. A mean dit- 
ference of 2.0 (2 standard errors) in Fig- 
ure 5 shows the power of the two-tailed 
t test at the .056 level to be approximately 
45. On Figure 2, a mean difference of 
1.26 (2 standard errors) shows the com- 
parable power to be approximately 144, 
In terms of the true standard error of the 
difference between sample means, then, 
the power of the t test seems relatively 
unaffected by violating the homogeneity 
of variances assumption, 

Likewise, the power of the U test is 
maintained, but at a level again slightly 
less powerful than the ¢ test. Similar 
results (not shown) obtain when both 
sample sizes are changed to 15. 

Introducing heterogeneity of sample 
size as well as heterogeneity of variance 
produces discrepancies which might be 
predicted from what is already known 
about the effect of this condition on the 
alpha level, Figures 6 and 7 portray 
these effects for the two possible com- 
binations of variance and sampe size 
considered, N(x,1 )5-N(0,4)15 and 
N (x,4)5-N (0,1)15 respectively. The 
first thing to observe in these figures is 
that effect on the alpha level (the power 
for a mean difference of zero). In Fig- 
ure 6 the alpha is for all curves less than 


Fic. 6. Empirical power functions for U 


and / tests for sampling scheme N(x,1)5- 
N(0,4) 15, 
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Fic. 7. Empirical power functions for U 


and ! tests for sampling scheme N(x,4)5- 
N (0,1) 15. 


the nominal values of .05 and .01, al- 
though the magnitude of the difference 
for the U curves is less than those for t. 
The actual values observed are as fol- 
lows: (one-tailed test-.05 level) ¢= .010; 
U=.029; (two-tailed test-05 level) ¢ 
= .009; U=.027; (one-tailed test-.01 
level) 1—.002; U= 005; (two-tailed 
test-.01 level) 1—.0001; U= 004. The 
power functions reflect the fact that while 
nominally at a ,05 level, the tests are 
actually operating at a reduced alpha 
value. At the reference value, a mean 
difference equal to two Standard errors, 
the power of the two-tailed :05-level t test 
is approximately .24, while the power for 
the comparable U test is about .28. Both 
of these values are much less than the 
values of about .45 which obtained under 
the other conditions, We may conclude 
that the conditions of heterogeneity of 
variances and sample sizes which pro- 
duced Figure 6 have affected the alpha 


level of both U and £, U to a lesser ex- 
tent, however, than t. 


level for the U 


resulting pow 
reflect a supe 
are observin 
for an alph 

of abou 


example, were con- 


r e basis of the normal bound- 
aries of the region of rejection for the 
.05 level of alpha. 


We may make exactly the same com- 
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_ parisons for the results depicted in Fig- 


ure 7, but note that the effects are in the 
opposite direction to those found in Fig- 
ure 6. For example, the alpha values 
obtained are all greater than the nominal 
values, and again the distortion to the 
t values is greater than those for U. For 
this case—N (x,4)5-N (0,1) 15—the actual 
alpha values observed are: (one-tailed 
test-05 level) t —.115; U=.081; (two- 
tailed test-,05 level) ? —.145; U = .064; 
(one-tailed test-.01 level) t= .048; L 
.020; (two-tailed test-.01 level) ! = .058; 
U = 021. Considering the reference 
mor used earlier, the empirical power of 
Lm two-tailed test with alpha equal to .05 
s a difference between means equal to 

© standard errors of the difference be- 
2m means, we find these values to be 
de for t and .52 for U. And as before, 
ü power curves behave as if they were 
E no curves for the observed value 
a aipha, the curves for ft being consid- 
rably above those for U at all points. 
Seis may conclude that violating the 
irem. DO of homogeneity of variance 
epa underlying distributions are normal 
S ittle effect on either the alpha level 
au m power of the ¢ or the U test as long 
im err Pi are the same. The viola- 
B of this assumption coupled with het- 
m £i hein sample sizes changes the 
RUA a level of both the ¢ and the U tests 
pre „Produces power functions which 
m ningly are roughly appropriate for the 
dc alpha level rather than the nominal 
GE The U test seems much less dis- 
bs ded by this particular violation than 
that the ¢ test, but it is by no means true 
m the U test is completely unaffected 
aS would seem to be implied by the term 
Nonparametric.” Rather it seems to be- 
have much as does the ¢ test, but some- 
what less sensitively to the violation of 
the assumption of homogeneity. 


Nonnormal Distributions 


At this point we will examine the em- 
Pirical power functions for the t and the 
eos when sampling for at least one 

n ie is from nonnormal distributions. 
S this way we will observe the effects, 

any, on the functions if one or both of 


the parent populations is exponential, or 
if one or both of the parent populations is 
rectangular. From an examination of 
the empirical alpha for the ż test, we al- 
ready know what the effects of sampling 
from a rectangular distribution are mini- 
mal on alpha. When the exponential dis- 
tribution is involved, however, some per- 
turbations in alpha may occur because of 
differences in skewness of the two distri- 
butions. The magnitude of these dis- 
turbances as seen in the earlier study was 
progressively lessened as sample size 
increased. At sample sizes of 25, the 
effect was virtually unnoticeable. 

These earlier observations are con- 
firmed in this study. Figure 8, for ex- 
ample, depicts the results of sampling 
from two rectangular distributions— 
R(x,1)5-R(0,1)5. Alpha is approxi- 
mately the correct value for both levels 
for both tests. In this figure, it appears 
that the power of ¢ is quite generally 
greater than that for U, but never by 
much except for the .01 level. Indeed, 
it would seem that for small differences 
between means, say, 0.5, U may be supe- 
rior to /. Similar results (not shown) 


were obtained when both sample sizes 
were increased to 15. 

The next distribution to be examined 
occurs when both samples are taken from 
exponential distributions as in Figure 
9—E(x,1)5-E(0,1)5. The 


values are in the 


empirical 


alpha appropriate 


POWER 


Fic. 8. Empirical power functions for U 
and £ tests for sampling scheme R(x,1)5- 
R(0,1)5. 
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Fic. 9. Empirical power functions for U 
and £ tests for sampling scheme E(x,1)5- 
E(0,1)5. 


ranges. Here again, for small differ- 
ences between the means, the U test 
seems consistently more powerful than 
the ¢ test, but this advantage disappears 
with greater mean differences. All in all 
one may conjecture that when both dis- 
tributions have the same Shape, even 
though not norma], the power functions 
of the ¢ and U tests have a relatively 
constant relationship. In most instances, 
the t test is slightly more powerful than 
the U test, as we have observed in most 
of our examples, 

All of the foregoing power functions 
have come from combinations of distribu- 
tions which produce essentially symmetric 
distribution of t and U when the null hy- 
pothesis is true (zero difference between 
means). We might expect more severe 
disturbances in Power functions in those 
cases which, because of basic asymmetries 
in the observed ¢ distributions, exhibited 
discordant obtained alpha values in the 
previous study. As may be recalled, the 
asymmetric distributions arose when the 
two parent populations differed in skew, 
one normal, the other exponential, for 
example. It was true, however, that in- 
creasing the sample size greatly dimin- 
ished the asymmetries since the under- 
lying probability mechanisms tend 
normalize the distributio 
size increases, 


to 
n of ¢ as sample 


Tt is such conditions, probably not too 
uncommon in the experience (or at least 
the imagination) of the research worker, 
which motivate a desire to seek Statistical 


tools which exhibit fewer allergic reac- 
tions to violations of assumptions. 
Because of these considerations, it is 
interesting to compare the power func- 
tions of the ¢ and the U test under such 
conditions of sampling, Figure 10 makes 
the comparisons for the case of exponen- 
tial and normal distributions with samples 
of size 5—E(x,1)5-N(0,1)5, First to be 
noted are the discrepancies in the ob- 
tained alpha values for the one-tailed 
tests due to the asymmetrical distribution 
of t and, surprisingly enough, for U. 
The .048-one-tailed values are .022 for 
U, .014 for t Likewise, the .008-one- 
tailed values are .002 for U, .001 for t. 
This is to be contrasted with the two- 
tailed values which 
to the nominal values—the .056-two-tailed 
value for U is 049, for t it is .068 
the .008-two.t: 
for t, .011. 


ing alpha, / and U both are affected by 
sampling from Populations which differ 
in skew, although it is Possible that the 
effect on U is less than that on ¢, 

A further examination of Figure 10 
reveals that the empirical power functions 
or U are, with one or two exceptions, 
higher than the power functions for t 
but the advantage is slight. 


As mentioned previously, 


increasing 
the sample size tends to norr 


malize the 


POWER 


an 10. Empirical Power functions for U 
and ¢ tests for sampling schemi E(x,1)5- 
N(01)5. Ais 
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Fic. 11, Empirical power functions for U 


ope for sampling scheme E(x,1)15- 


distribution of t in the null case, the effect 
and 8 T lessen the discrepancy of nominal 
m Obtained values for the one-tailed 
i i Figure 11 presents the curves 
E(x uw increased sample size case— 
eum )15-N (0,1)15. As before, our at- 
differs is first directed to the zero-mean- 
ass E points, and as expected, the 
aa es for t have become closer to the 
minal values—one-tailed test, .037 for 
ee and .006 for .01 level; two- 
test, .044 for .05 level and .012 for 
level. Increasing the sample size to 
pro however, does not seem to im- 
TAN the performance of the U test. For 
poene ea U test, the obtained alpha 
6 es are 016 for the .05 level, and .000 
E the 01 level, For the two-tailed test, 
ned alpha values again seem to 
s higher than the nominal values—.068 
T the .05 level, and .019 for the .01 level. 
^ Figure 11, we note that the power 
sanctions for the U test are almost in- 
riably below those for the ¢ test. 
notable phenomenon occurs for the 
i level, two-tailed U test. This particu- 
* curve shows a decrease in power from 
Mean difference to 0.25 before starting 
yg Ein. While sampling error may 
ae „account for the dip, it is certainly 
ge ble for a test to be "biased," the 
Nical term for such an occurrence. 
th Act there is no reason to believe that 
test as a test for mean differences 


sk 
Ould not be biased. The U test is only 
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fortuitously a test for mean differences, 
being fundamentally a test for differences 
between distributions. As we have seen 
it is less sensitive to some kinds of differ- 
ences between distributions than others, 
being perhaps maximally sensitive to dif- 
ferences in central tendency of two dis- 
tributions. But there are many measures 
of central tendency, the mean and the 
median, for example. We know that the 
mean and median are different in skewed 
distributions. In the combination of dis- 
tributions with which Figure 11 is con- 
cerned, namely, exponential and normal, 
it is possible for medians to be the same 
ior the two distributions which necessi- 
tates that the means be different and vice 
versa. For the ¢ test which explicity 
evaluates differences in sample means this 
seems to present no problem. The U test, 
however, may well be more sensitive to 
median differences and thus show the bias 
when used as a test for differences be- 
tween means. 

Other combinations of distributions 
which were explored in no way change 
the general picture which we have con- 
tinually observed. We may summarize 
that picture as follows: In general the 
i test is more powerful than the U test, 
but never by much. Based on the evi- 
dence we have seen, one might conjecture 
that over a long series of experiments 
involving distributions of the kind we 
have used, a consistent use of the ¢ test 
might result in, say, 5% more rejections 
of a false null hypothesis than would a 
consistent application of the U test. There 
are many other kinds of distributions 
arising in research, however, for which 
this statement need not apply. Depending 
upon one's inclinations, such a conclusion 
could be interpreted as ample grounds 
for habitual use of either of the two tests. 
There are, as we have seen, other consid- 
erations. As a general rule we might 
say that the ¢ test seems to provide the 
appropriate power curve for the actual 
alpha level involved, whereas, the U test 
shows more variability in its power func- 
tions. That is to say that if the actual 
(not the nominal) alpha is .05, the power 
curve for the ¢ test in most cases is prob- 
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ably very similar to the power function 
for the theoretical case for the .05 level; 
it is also to say that the statement is not 
nearly so true of the U test. This prop- 
erty of the ¢ test is useless, however, 
unless we know what the actual alpha is. 
In cases when assumptions are violated 
we will not. It is true that the violations 
of the assumptions underlying the ¢ test 
can produce large discrepancies between 
nominal and actual alpha when the sam- 
ple sizes are other than nearly equal. 
This in itself contraindicates the use of 
the ¢ test in these instances. A further 
somewhat surprising consideration is that 
the U test is not truly distribution free. 
It is always sensitive to differences in 
distributions, and sometimes seems more 
affected by differences (other than mean 
differences) than is the test. 

If one final word is to be said it might 
be that one should not avoid using the 
t test (provided relevant considerations 
have been made) solely on the grounds 
that it is subject to error when assump- 
tions are violated and that the U test is 
not subject to error under the same con- 
ditions. Both of the statements are un- 
reasonable in view of the data. One 
should not, however, refrain from using 


the U test in place of the ¢ test on the 
grounds that it is considerabl 
erful than the ¢ test, 
not true. 


y less pow- 
This is simply 
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When quanta of light strike the 
retina, some of them.are absorbed by 
the pigment molecules contained in 
the visual receptor cells, and this ab- 
Sorption ultimately results in the acti- 
vation of the visual nervous system. 
Recent developments in the biochem- 
istry of the visual pigments has given 
us a fairly clear picture of the micro- 
structure of quantal absorption. 
theoretical analysis of this first stage 
in the visual process will be presented 
Rm analysis will be shown to 
eo a possible explanation of cer- 

ain striking changes in hue and 

brightness that occur when a stimulus 
of very high luminance is steadily 
Viewed. 


Turory or VisuAL PHOTO- 
CHEMISTRY 


i There is fairly good agreement that 
light triggers neural events by proc- 
esses of the following general type. 
Quanta of light, after passing through 
the other media of the eye, arrive at 


the retina where they may or may no 


S 1 This research was supported by United 
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ex his work on the brightness reversal 
Periment. 


hit receptor cells. Those quanta that 
finally arrive in the region of the 
visual pigment of the receptor cells 
may or may not be captured by the 
molecules of pigment. If a quantum 
is in the close vicinity of a particular 
pigment molecule, the probability * 
that it will actually be captured by 
the molecule depends upon the rela- 
tionship between the wave length, or 
energy, of the quantum and the physi- 
cal structure of the molecule and, its 
surroundings. If all these probabili- 
ties are satisfied, the quantum is ab- 
sorbed by a visual pigment molecule. 
When, before absorption of the quan- 
tum, the pigment molecule was in a 
state that will hereafter be called the 
"regenerated" state, the absorption 
will cause the molecule to change its 
state (to what will be called an ‘‘un- 
regenerated” state). For example, 
Kropf and Hubbard (1958) suggest 
that the change in state is a cis-trans 
isomerization. The changing of state 
of a regenerated pigment molecule 
sets up a process in the receptor which 
eventually results in the discharge of 
a packet of some chemical mediator at 
a synapse farther along in the retina. 
When the concentration of chemical 
mediator at the synapse is great 
enough, a nerve impulse is triggered, 
and either that impulse or another 
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resulting from it, travels up the optic 
nerve. At the synapse, there is also a 
substance that destroys the chemical 
mediator and this, combined with 
diffusion effects, give the mediator a 
short “half life.” Therefore the prob- 
ability of triggering a nerve impulse 
depends upon the rate of discharge of 
packets. Above threshold, the fre- 
quency of nerve impulses will be 
greater, the greater is the rate at 
which packets of mediator are being 
discharged. Since each discharged 
packet is the ultimate result of the 
absorption of a quantum by a regen- 
erated molecule, the frequency of 
nerve impulses will depend upon the 
rate at which quanta are absorbed by 
regenerated molecules. 

The changes that occur in retinal 
photopigment when light is absorbed 
are reversible, and, if the eye remains 
in darkness long enough, virtually all 
of the pigment will have regenerated 
to its initial, ready-to-trigger, state. 
Some of the more recent evidence on 
cone pigments iz vivo indicates that at 
least the main part of this regenera- 
tion process goes on at the same rate 
whether the eye is in darkness or in 
light, the rate depending only upon 
the concentration of unregenerated 
pigment present (Rushton, 1958). 

If the above general conceptions are 
correct, then, when a Subject is 
steadily fixating a field of fixed lumi- 
nance, retinal pigment is both being 
broken down and regenerated, and the 
frequency at which nerve impulses are 
being fired in any one Optic nerve 
fiber depends upon the rate at which 
regenerated molecules are being 
broken down. Hereafter, the term 
"output" will be used to denote the 

instantaneous concentration of chemi- 
cal mediator at the receptor terminus, 
This receptor output is thus directly 
related to the rate at which quanta are 
absorbed by regenerated molecules, 
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Proportion of Regenerated Pigment 


2 
5 
o 
Time 
Fic. 1. 


Proportion of regenerated pig- 
ment as a function of the time after onset of 
the stimulus, for two different illuminances 
(B greater than A); and output as a function 


of the time after onset of the stimulus for the 
same two illuminances, 


and it is similarly related to the fre- 
quency of triggering of nerve im- 
pulses somewhere farther along the 
retinal activity chain, 


of a single receptor 
£ considerations are 
€ceptor is completely 
l efore the onset of the 
Stimulus, so that the proportion of 
regenerated to total Pigment is 1.0. 
At the time represented at the origin, 
a steady light is turned on. The 
pigment concentration will drop rap- 


if the precedin 
correct. The r 


dark adapted bi 


CHANGES IN THE APPEARANCE OF STIMULI 


idly at first and then level off as it 
approaches some asymptotic level, 
the level at which the rates of break- 
down and regeneration are equal. 
TThe lower curve, labeled B, repre- 
sents the same events for an identical 
receptor under a more intense stimu- 
lus. 

The curves in the lower part of 
Figure 1 are a generalized representa- 
tion of the outputs (rates of pigment 
breakdown) of the same two receptors 
under the conditions of the upper 
part of Figure 1. Since the receptors 


gp 


Proportion of Regenerated Pigment 


Output 


Time 


Fic. 2, Proportion of regenerated pig- 
Et as a function of time after onset of the 
pesnulus, for two different illuminances (A at 
jnoderate illuminance and B approaching an 
or nite illuminance); and output asa function 
sa time after onset of the stimulus for the 

Mme two illuminances. 
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are taken to be identical, and since 
at time 0 both concentrations are 
1.0, the initial output of B will be 
greater than the initial output of A. 
Further, since the time constants of 
regeneration for A and B are taken 
to be equal, the output of B must ap- 
proach a final level that is higher than 
the asymptote of A. This is true 
even though the equilibrium concen- 
tration of regenerated pigment is 
lower for B than for A. 

The upper part of Figure 2 repre- 
sents the regenerated pigment con- 
centration curves for the same two 
receptors when the illuminance on B 
has been raised almost to infinity, but 
the illuminance on A is the same as in 
the preceding figure. In the first 
short interval after the light is turned 
on, virtually all of the pigment mole- 
cules in B will be broken down. 
From that time on, each molecule 
that flips back to the regenerated 
state will immediately be broken down 
again. In other words, when the 
illuminance approaches infinity, the 
concentration approaches equilibrium 
extremely rapidly, and the equilibrium 
level depends only upon the time 
constant of regeneration of the pig- 
ment. The lower part of Figure 2 
shows the output curves for these re- 
ceptors, again when the illuminance on 
B approaches infinity. It will again 
be true that, when the stimuli are 
first turned on, B will have the higher 
output. Further, at equilibrium, B 
must still have the higher output, 
even though its regenerated pigment 
concentration is very much lower. 
In general, the equilibrium rate of 
breakdown of regenerated pigment 
(output) is greatest when the illumi- 
nance is infinity, since that rate equals 
the rate of regeneration and since the 
rate of regeneration will be greatest 
when the concentration of wnregener- 
ated pigment is greatest. But, since 
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Receptor B will reach its equilibrium 
level almost instantaneously, its out- 
put curve must cross the output of A 
twice, as in the lower part of Figure 2. 
(This is necessarily true as long as the 
illuminance on A is great enough that 
its initial output level is greater than 
the equilibrium level for B.) Obvi- 
ously the illuminance on one of a pair 
of receptors need not be nearly in- 
finite in order that the pair exhibit a 
reversal of output relations. There 
will be a large set of pairs of illumi- 
nances for which reversal should 
occur. 


A More Sprciric SET or Assump- 
TIONS AND THEIR CONSEQUENCES 


Now let us substitute, for each of 
the general conditions described above, 
a corresponding specific assumption 
about the operation of the visual sys- 
tem. From these assumptions, cer- 
tain quantitative aspects of the visual 
Phenomena to be discussed may be 
derived, and the derivations examined 
in comparison with the data them- 
selves, 

If the absorption of 
particular molecule is 
the state of the other molecules in the 
receptor, then it follows Írom statis- 
tical considerations that the rate of 
absorption by Tegenerated molecules 


1S; 
(2). -a 


where N is the number of regenerated mole- 
cules, A is the probability that a quantum 
incident on a region containing a regenerated 
molecule will bleach that molecule (the photo- 
sensitivity of each regenerated molecule at 
the wavelength of the incident quantum), 


and Q is the number of quanta incident dur- 
ing the Time dt. 


a quantum by a 
independent of 


[1] 


Recent studies of the bleaching of 
cone pigments in the living human 


Tom N. ConxswEET 


eye support the notion that, at least 
to a good approximation, the regen- 
eration of cone pigments is a first 
order chemical reaction (Rushton, 
1958). We will assume, accordingly, 
that the probability that any particu- 
lar unregenerated molecule will be 
regenerated in a unit of time will be 
independent of the states of the sur- 
rounding pigment molecules. It fol- 
lows then that the rate of regeneration 
of unregenerated molecules is: 


(2) - K(M-N) [2 


where M is the total number of molecules, re- 
generated or not, in the Teceptor, N is, again, 
the number of regenerated molecules, and K 
is the Probability that an unregenerated 
molecule will be regenerated in Time dt, 


The net rate of 
ated molecules is: 


dN 
da ^ E(M — N) — 4QN [3] 


change of regener- 


We will now 
the proportion 
ina receptor: 


define the quantity X, 
of regenerated pigment 


X = N/M 


Substituting Equ 
tion 3 and dividing 


dX 
di^ KA- X) — 40X [5] 


[4] 


ation 4 into Equa- 
through by M: 


The Solution 


: to this differential 
equation yields: 


Lr - AQ 5 
K+4aQ + erag [6] 


when 


the initial condition is X = 1 
that i 


1S, complete dark adaptation. 
In the preceding section, the output 
of a given receptor was defined as the 
rate at which quanta are absorbeg by 
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regenerated molecules. Therefore: 


dN 
i= (55) =AQN=AQMX 


dt 
K 
g aom| e + 
xoc] [7] 


This is an equation that relates the 
Output of any given receptor to the 
irradiance and duration of a steady 
light falling upon it.) The lower parts 
of Figures 1 and 2 are actually plots 
of Equation 7 for various values of 


CONSEQUENCES FOR BRIGHTNESS 


_ It seems to be a reasonable assump- 
tion that the brightness of a lighted 
area will increase as the frequency of 
nerve impulses originating from the 
retinal region corresponding to that 
area increases. If this assumption 
and those concerning the conversion 
of quanta to impulses are correct, 
then the falling of the curves in the 
lower parts of Figures 1 and 2 repre- 
Sent the fact that the brightness of a 
stimulus of fixed luminance decreases 
as the eye light adapts to it. 

The considerations illustrated in 
the lower part of Figure 2 also lead to 
the following prediction. If the eye 
1$ first dark adapted, then one area of 
the retina is illuminated (say with 
White light), and another area is 
Simultaneously illuminated much 
More intensely, the more intensely 
illuminated area will look first brighter 


? None of the steps in this derivation is 
Original, Pieces of it can be found in the 
Writings of Troland (1930), Hecht (1935), 

ushton (1958), for example. And, al- 

ough the writer has not found the complete 
it vation all in one place in the literature, 

May well be there somewhere. 
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Fic. 3. Times of brightness reversals as a 
function of retinal illuminance for the stimu- 
lus configuration shown in the upper right 
corner. (The surround was dark, the back- 
ground illuminated, and the horizontal bar 
consisted of a strip of 0.4 log unit neutral 
density filter. Each point is a mean of four 
judgments.) 


than the other, then darker, and 
finally somewhat brighter again. 
That is, there should be a period of 
time during which the less intensely 
illuininated region should look brighter 
than the more intensely illuminated 
one. This paradoxical prediction has 
been tested and it is correct. The 
predicted brightness reversal does 
indeed occur, and is very easy to 
demonstrate. 

The data in Figure 3 were taken 
under the following conditions. The 
subject fixated a small bright point 
in the center of the field shown in the 
upper right corner of the figure. The 
field was shown in Maxwellian view, 
through a simple optical system whose 
effective aperture was $5. The 
source was a 32-candle power auto- 
mobile taillight bulb, run from a regu- 
lated DC supply. The filament 
image at the pupil was 1 millimeter 
in diameter, centered on the natural 
pupil. The luminance of the field 
was varied with neutral density filters. 
Heat glass was used to filter out the 


infrared. 
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When the field is first turned on, it 
looks like a dark bar on a bright field. 
But after a time that depends upon 
the luminance relations, the field ab- 
ruptly reverses, so that the bar looks 
considerably brighter than the back- 
ground. Then the brightnesses grad- 
ually converge and reverse again, 

until finally the background looks 
just slightly brighter than the bar. 
To collect the data in Figure 3, the 
following procedure was used on each 
trial The subject's viewing eye was 
dark adapted for 5 minutes. Then 
the stimulus figure was turned on and 
two clocks started. The subject 
looked steadily at the fixation point, 
and at the first instant when the bar 
and its background were equally 
bright (changing from normal to re- 
versed brightnesses) he pressed a key, 
stopping one of the clocks. When the 
brightnesses were again equal (chang- 
ing from reversed to normal) he 
pressed a second key, Stopping the 
second clock. The two times were 
recorded, and each point on the curves 
in Figure 3 is a mean of four such 
times. The area between the two 
curves in Figure 3 represents the 
times when the brightnesses are re- 
versed. At least 45 minutes elapsed 
between successive trials, and the 
order of illuminances was balanced. 
The second equality point is much 
harder to judge than the first one, 
and the judgments are less reliable. 
Part of the reduction in reliability is 
due to the fact that the changes are 
occurring very gradually, but, in 
addition, strong hue differences be- 
tween the bar and ground appear 
after the first 20 seconds Or $0; the 
background turns reddish and the 
bar less red. (There is not much 
point in trying to interpret these 
color differences physiologically, be. 
cause the wave length composition of 
the stimuli in this set of experiments 


` Opaque strip. 
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Fic. 4. Times of brightness reversals as a 
function of retinal illuminance. (For these 
data, the bar, as shown in Figure 3, was re- 


Placed by an opaque strip. Each point is 
the mean of four judgments.) 


was very complex, due to absorption 
by the heat glass, etc.) 


Occurs 
equally strongly whether the fixation 
point is in the bar or the background. 
It cannot be explained by inhomo- 


taken 
When the bar was -4 log units less 
Intense than the background. (The 


ar trans- 
The data 

r the strip 
of .4 density filter was replaced by an 
In this case, the time 
Ss strongly dependent 
inances (and the hue 


relations are le 
upon the illum 


: There is extensive evi- 
dence in the literature to indicate that 
the Stray light in this Sort of stimulus 
Situation is appreciable (see DeMott 

Oynton, 1958, for an extreme ex- 
ample) F urthermore, there is 4 good 
Possibility that the illuminances at 
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Approx. 5* 


aV ocu 
m i. 


p 5. Apparatus for a demonstration 
e rightness reversal. (The inset figure 
ows the subject's view.) 


es edges of the figure (where stray 
z t is greatest) are the critical ones. 
(This possibility is currently being 
ea) Thus, the retinal condition 

which an opaque bar is used in the 
apparatus is on a continuum with 
any in which a filter of transmittance 
&reater than zero is used. 


Ms easy to set up a demonstration of this 
Ehtness reversal, as illustrated in Figure 5. 
RUD of cardboard is placed across a round 
Tet in a larger board. This hole is just a 
Gea smaller than the face of the sort of photo- 
ha lamp that has a frosted face and a 
Es n reflector (e.g., GE ZPH/RFL2) A 
ds all hole in the middle of the strip will serve 
v a fixation point (it is essential to fixate 
*ry steadily for this demonstration). The 
n of the photoflood lamp should be viewed 
Enough the hole and strip, and through heat 
sorbing glass, from about 4 or 5 feet away. 


AN Extension OF THE THEORY TO 
CHANGES IN HUE 


When a subject steadily fixates a 
field of extremely high luminance in 
e long wave length region of the 
Spectrum, the hue of the field under- 
T very striking changes. Some of 
cn changes have been mentioned 
dependently by Auerbach and Wald 
955) and by Noddack and Jarzyk 


(1955), in the course of studies of 
other color phenomena. Cornsweet, 
Fowler, Rabedeau, Whalen, and Wil- 
liams (1958) briefly reported a study 
of the color changes themselves. Ac- 
cording to all of those articles, the 
nature of the color changes is as fol- 
lows. The subject looks steadily at a 
field that is illuminated with light 
from the long wave length end of the 
spectrum. When the luminance is 
low or moderate, the field will look red 
or orange. However, if the lumi- 
nance is extremely high, it will look 
orange at first, but then rapidly shift 
in hue through yellow to a vivid green. 
This pattern of changes is reported 
to occur for stimuli of several different 
wave lengths in the "red" end of the 
spectrum, and for monochromatic as 
well as broad band stimuli. Stimuli 
with wave lengths in the green region 
of the spectrum are reported to pro- 
duce extreme desaturation, but no 
clear hue changes are reported. Auer- 
bach and Wald (1955) also report that 
bright blues will change to red, but 
such a change was not observed in 
either of the other two studies cited. 
A review article by Cohen (1946) pro- 
vides a number of references to other 
studies of the effects of high luminance 
stimulation. 

The theoretical considerations dis- 
cussed in the preceding sections sug- 
gest an explanation for some of these 
changes in hue. The explanation is 
based upon the retinal mechanisms 
already mentioned, along with some 
simple and widely held assumptions 
about the physiological bases for color 
vision. 

First, we will assume that there are 
at least two sorts of cones that differ 
in the pigments they contain, the two 
pigments having different but over- 
lapping absorption spectra. The ab- 
sorption spectra for such a pair of 
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bed 
Proportion of Incident Quanta Absor! 


610 Mj. 
Wovelength 


Fic. 6. Generalized Spectral absorption 
curves for two hypothetical retinal pigments, 
(The wave length 610 mu, indicated on the 


abcissa, will be specifically referred to later 
in the text.) 


cone types are shown in Figure 6.3 
The two cone systems will be labeled 
R and G, as in Figure 6. Now con- 
Sider the situation in which a subject 
is shown a stimulus and is forced to 
call it either red or green, no other 
alternative being permitted. Let us 
assume that the probability of his 
giving a "red" judgment depends upon 
the ratio of the outputs of the R and 
G systems. In the simplest case, 
when the R output is greater, he will 
be more likely to call the stimulus red, 
when the G output is greater, he will 
call it green, and when they are equal, 
he will judge it green half the time 
and red the other half. ere is no 
theoretical necessity that the output 
ratio be 1.0 for 50% “red” judgments, 
but that particular condition will be 
assumed for much of the remainder 
of this paper, merely to make the dis. 
cussion easier to phrase. 

Figure 7 shows the results obtained 
when a subject is forced to judge a 
flash of monochromatic light as either 
red or green, as the wave length in the 


*The theory discussed in this paper re. 
quires that there be at least two cone systems, 
but does not approach the question of how 
many there may be altogether. 
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region between red and green was 
varied by the method of constant 
stimuli. It should be noted that the 
slope of this curve is very steep, so 
that a change of just one or two 
millimu on either side of the 50% 
wave length will elicit reliable judg- 
ments of red or of green. Now if a 
subject is given a stimulus with a wave 
length right at the 50% point and 
asked what its hue really is, he says 
it is pure yellow. It follows from 
these observations that the last as- 
sumption discussed 
lent to assuming the following: when 
the R output exceeds the G output, 
the subject will see red, when the G 
output is greater 
and when the tw 
see yellow. 


above is equiva- 


; he will see green, 
9 are equal, he will 
(This is of course true 
only for the Spectral region between 
Ted and green. There may be other 
Spectral regions in Which the R and 


G outputs are also equal, but where 
the hue is given some other name, 
e.g., blue. The question of whether 
100 > 
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the yellow sensation is produced by a 
mixture of the R and G outputs or 
represents an excess of yellow over blue 
output with the red and green out- 
puts cancelling each other, is not 
relevant to any of the present dis- 
cussion.) 

The explanation of some of the hue 
changes that occur to stimuli of high 
luminance follows directly from these 
assumptions. Monochromatic light 
in the long wave region of the spec- 
trum, for example at 610 millimu, will 
be absorbed by both the R and the G 
cone pigments, but since the absorp- 
tion coefficient for the R pigment is 
higher than the G at 610 millimu 
(see Figure 6), the R cones will ab- 
Sorb a greater percentage of the inci- 
dent quanta. This is exactly equiva- 
lent to putting a greater illuminance 
on the R than the G cones. There- 
fore, if the R and G cones differ only 
in their absorption spectra, and a 
Stimulus of very high luminance at 
610 millimu is given, their output 
versus time curves will look like those 
in the lower part of Figure 2, Curve A 
representing the G and B the R sys- 
tem. The stimulus should look red 
at first, then change through yellow 
(when the curves cross) to green, and 
finally slowly change back through 
yellow to orange. 

It has been reported in previous 
papers (Cornsweet et al., 1958) that a 
high-luminance red stimulus will ap- 
Pear to change from red through yel- 
Ow to green. "No further color 
Changes were observed, even though 

Xation was maintained for as long as 
three minutes" (p.898). The present 
More extensive data at a variety of 
Wave lengths, and at luminances 
greater than those used in the earlier 
Study, clearly show that further 
Changes do occur. In fact, the hue of 
the stimulus eventually changes back 

Tom green through yellow to orange 


as is predicted by the theoretical 
argument above. This later set of 
changes happens to take appreciably 
longer than 3 minutes at the relatively 
low luminances and at the particular 
wave lengths used in the earlier 
study (Cornsweet et al, 1958). But 
at other wave lengths and luminances, 
all of the changes appear to be over 
within a minute or two (see the data 
in succeeding sections). 


GENERAL ASPECTS OF Hur CHANGES 


The general course of the hue 
changes that occur when viewing à 
high-luminance stimulus at 620 milli- 
microns is illustrated in Figure 8. 
This figure is drawn from a set of 
crude binocular matchings, in which 
the high-luminance stimulus was pre- 
sented to one eye and a matching 
stimulus of much lower luminance, 
whose wave length could be varied, 
was presented to the other. The 
curve in this figure should not be 
taken as anything more than a quali- 
tative indication of the phenomenon 
under discussion. Quantitative data 
of a more reliable and useful sort will 
be presented later. 
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Fic. 8. Wave length of a low luminance 
stimulus flashed to the left eye that matches 
the hue of a high luminance stimulus, at 620 
mu, steadily fixated by the right eye, asa func- 
tion of the time of viewing the high-luminance 
stimulus. 
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At this writing, the limits of the 
range of wave lengths for which hue 
changes occur have not been well 
established. However, the following 
general statements are true. High- 
Iuminance stimuli with wave lengths 
in the "green-yellow" region of the 
spectrum (from about 560 to 580 
millimu) look green at first, turn a 
much deeper green, then back through 
yellow to orange, and finally back to 
yellow or greenish yellow. Any wave 
length between 580 and 655 mu 
will look orange at first, change 
through yellow to a good, saturated 
green, and finally back to an orangish 
yellow. It is probable that light of 
wave lengths longer than 655 mu also 
appears to turn green if its luminance 
is high enough, but 655 mu is the limit 
with the present apparatus. 

Stimuli below 560 mu have not been 
studied systematically, but in general 
they seem simply to desaturate to a 
white at the luminances available in 
the present apparatus. 

For stimuli in the region of about 
620 down to 560 mu or less, when the 
luminances are well above those nec- 
essary to produce the change to 
green, another change seems to be 
superimposed upon the first 15 or 20 
seconds of viewing. The field begins 
its normal course of hue change, but 
then abruptly turns to a desaturated 
pink or violet. It retains that ap- 
pearance for as long as 20 seconds, 
then changes back to orange, yellow, 
or green, and the normal sequence is 
resumed. This change to pink or 

violet has not yet been studied 
quantitatively. 

All of the changes in appearance 
that have been observed depend in 
character and timing upon the lumi- 
nance, and, of course, the wave 
length of the stimulus. These de- 
pendencies will be analyzed below. 


QUANTITATIVE ANALYSIS or Huer 
CHANGES 


The photochemical theory out- 
lined above requires that the time 
course of changes in hue be systemat- 
ically related to the luminances and 
wave lengths of the stimuli. In order 
to evaluate the theory quantatively, 
data have been collected on the time 
course of hue changes as a function of 
retinal illuminance for a number of 
wave lengths. Those data and their 
relationship with the theory will be 
presented below. 


PROCEDURE 


Judgments Used for the Collection of 
Quantitative Data 


The data Presented h 


E : ere were collected by 
using two different sor 


A ts of procedures, The 
data for Figure 8 (and the data for the bright- 


ness reversals discussed earlier) are based 
upon a matching technique, 


mension (hue in Fig 
for the brightness reversal data), 
procedure used here i 


the matching field is res 
One eye and the eee 


"^ 


— a a 
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effects of the stray light are not eliminated, 
but just rendered less estimable. The collec- 
tion of the data represented in Figure 8 took 
pente: of the fact that, even when the two 
ields are projected on to noncorresponding 
eee of the two retinas, rivalry occurs. That 
a ES stimulus of high luminance is pre- 
ri continuously and the matching field is 
ane ionally flashed to the other eye, the only 
pon visible during the flash seems to be the 
E field. It is then possible to find a 
the im h wave length that is the same hue as 
a igh-luminance field at a given time after 
RM of the high-luminance field. How- 
pr E procedure clearly involves so many 
anes o "unknown effect (stray light, rivalry, 
mes m. from the flash, etc.) that it is at 
p seful only to give a rough idea of the 
s of hue changes that do occur. 

No fees sort of matching technique 
ot dat nave been used in this study. Instead 
field emning the time at which the intense 
s ene one of a fixed hue, the subject 
GS NT been presented with two patches 
differs ifferent wave lengths and high but 
Both n. luminances. Then as the patches 
Bis dii n in hue (and brightness), the 
fines. en they match could have been deter- 
ete While not all pairs of stimuli would 
dim a match, there must be sets of them 
hee do so. This kind of matching 
Eee dure would not be so severely affected 
io ind light, and in many ways !$ preferable 
he color naming procedure. Brindley 


` (1960) has clearly pointed out that this type 


etching procedure (or at least one almost 
logi dps with it) enables one to draw physio- 
sith, conclusions from psychophysical data, 
ph out requiring what he calls "psycho- 
is anal linking hypotheses." However, it 
Neq true that, when such a procedure is 
ftot ı psychophysical linking hypotheses can- 
be tested, and one of the aims of this 
d ET is to examine just such a question, 
of mely, what are the physiological correlates 
the condition under which a subject re- 
re the hue as red half the time and green 
e other half. The matching technique 
apa does have many advantages over 
i or naming as long as psychophysical link- 
ng hypotheses are not in question, and studies 
USing strict matching procedures are now in 
Progress, 
ien the other hand, the procedure that con- 
th s of recording the instants in time when 
pd Subject calls the stimulus neither reddish 
d greenish yields very reproducible results, 
rel results which may be interpreted in a 
atively straightforward way- 
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Apparatus 


The apparatus used to collect all of the 
color data in the present study is shown sche- 
matically in Figure 9, The source, S, was a 
tungsten-coil-filament lamp (GE PH/18A/ 
T10P), run from a constant voltage trans- 
former at 6 volts, 18 amperes. Lens Li 
formed an image of the filament on the en- 
trance slit, En, of a Bausch and Lomb grating 
monochromator of 500 millimeter focal length, 
with a 600 lines per millimeter grating, blazed 
in the visible. The optics of the monochro- 
mator imaged the entrance slit (and the fila- 
ment) on the exit slit, Ex, with an equivalent 
aperture ratio of 44. An airspaced, coated, 
achromatic lens, Ls, focal length 150 milli- 
meters, f = 1.9, imaged the exit slit in the 
plane of the pupil of the subject's eye so that 
he saw a Maxwellian view of the back surface 
of the lens Le The entrance and exit slits 
of the monochromator were set to widths of 
1.5 millimeters, resulting in stimuli with 
spectral band widths of 5 milimu. The 


MONOCHROMATER ; 


Shutter 


Fic. 9. Apparatus for delivering very 
intense monochromatic stimuli. (See text 
for description of specific elements.) 
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length of the exit slit was also set to i» s 
meters. Therefore, since Lens Ls o ae 
magnification, the image at the pupi nas a 
square 1.5 millimeters on a side. pu e is 
small enough that as long as the exit slit is 
imaged at the center of the natural feu 
changes in the size of the natural pupil wi 
not produce appreciable changes in Fetinal 
illuminance. A neutral-density wedge, W, 
and filters, F, were used to adjust the lumi- 
nance of the stimuli. The wedge was close 
enough to the exit slit that no balancing 
wedge was needed. A clear piece of glass, G, 
reflected a small fraction of the stimulus light 


the total light at other wa 
than 5% of the stimulus luminance. 

The subject’s head was held in position by 
a dental-impression bar. At his hand was a 
control panel containing push buttons which 
operated timers, so that the timing of any 
changes in appearance could be recorded, 

When the stimulus was on, the subject sawa 
circular field, uniformly illuminated, as illus- 
trated in Figure 10, The dark circular line 
in this field was produced by cementing a 
circle of fine wire to the back surface of L,, 
This circle helped the subject to make his 


‘For the relatively short slit-length used 
here, the coil filament bulb delivered more 
energy through the System than did the rib- 
bon filament bulb that is ordinarily supplied 
with the monochromater, 
the coil filament, the energy 


predictable way. This variability was prob- 
ably due to the fact that only a very small 
piece of the filament image was used, and, 
since the image is not uniformly intense, 
slight amounts of sagging of the coil or its 
supports could result in noticeable changes in 
output at the exit slit, For this reason, the 
monochromater Output was continuously 
monitored, and the intensity Settings were 
made from the photomultiplier output rather 
than from the wedge or filter calibrations, 
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Fic. 10. Stin 
to the subject, 
ar area was unifo; 
chromatic light, 
to fixate the center 
the hue of the inner disk.) 


nulus pattern as it appeared 
(The entire 10-degree circu- 
rmly illuminated with mono- 
Phe subject was instructed 
of the pattern and judge 


judgments, in that he 
within the circle, i 
effects that small 


the outer margins of the field,5 


RESULTS op Hur CHANGE Srupy 
The data Plotted in Fig 
how the ti 


varies as a function of th 


lus at 610 millimu, 
an irradiance of 3x 


Second per square millimeter 
(uncorrected foi 


to indicate that it w. 
reddish nor &reenish) after 17 Seconds 


i The particular choice of a 
Stimulus disc is not crucial, 
Changes occur. With fields as 5 

utes of arc in diameter, 
still smaller sizes.) 
temporal aspects of the jud, 
when the fi 
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610 millimu (Figure 11). As the 
wave length increases, the lower 
curves shift toward the higher irradi- 
ances, but retain the same general 
shape. The upper curve at 630 
millimu shows evidence of a peak at 
the lower retinal irradiances. The 
upper curve at 640 millimu exhibits 
a very large amount of variability, 
and is not plotted beyond 9 X 10" 
quanta/sec/mm* because the change 
back to yellow had not occurred dur- 
ing 210 seconds of continuous viewing. 
The variability in all cases increases 
very greatly as the time of viewing 
o E. My D becomes extremely long. It will be 

kétinannirraaiunce (avonls pRGAEzORÓ shown that curves of this form may be 

per square millimeter of retina) x 10'? expected from the theoretical con- 

siderations discussed previously. 
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of a 11. Time to change hue as a function 
e retinal irradiance of the stimulus, at 


610 e 
thoes, (Each point is a mean of two read- 


hol 100 seconds after the stimulus 
ad first been turned on. The area 
no the two curves represents 
feria during which the stimulus Subject TNC 

D green. »--- 640 millimicrons 


The upper curve, for the second 


140 ——T —— 630 
x— 600 


QNM 74 — — a EE mie, 
Q4 ocu a Qc = pn cc X 
= t-n 
o ê 


3 
2 
S 
chan 8 
ae ge to yellow, extends out to a To | 
tinal irradiance of 21 X 10" quanta 2 
aa second per square millimeter, the © joo 
; uimum irradiance available in the Ei 
jBstatus. The lower curve is not PE 
plotted beyond 3.9 X 10" quanta/  & 
no nni because the hue changes in co 
ids region were obscured by a change 
x desaturated pink or violet, as indi- 40 
pe roughly on Figure 11. For 
this wave length, the stimulus never 2 Desaturation 
ieee yellow at irradiances below 
3 x 102 quanta/sec/mm*. " 
Figure 12 shows the same sort of g- she e a Ud 
at a Retinal Irradiance (Quanto per second 
| a for light of three other wave Fer square millimeter of retina) x 10" 
*ngths i d of 
nd of the s 
Spect s in the long wave s Fic. 12. Timeto change hue as a function 
rum. The curves for 600 of retinal irradiance for three different wave 


"illimu are very similar to those for lengths. (Each point is a single judgment.) 
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RELATIONSHIP BETWEEN THE DATA 
J AND THE THEORY 


The data in Figure 11 have been 
replotted as dots in Figure 13. They 
show how the time taken to turn 
yellow varies as a function of the ir- 
radiance of a monochromatic stimulus 
at 610 millimu. Every dot in this 
figure represents a time when the 
stimulus looked yellow. Therefore, 
according to the theoretical arguments 
outlined above, each of these points 
stands for a condition where the out- 
puts of the R and the G systems are 
in some particular ratio. That is: 


9, = CQ, [8] 


where C is the value of the ratio at 
which the hue is equally often called 
red and green in a forced choice. (In 
the preceding discussion C was as- 
sumed to be 1.0 for simplicity.) 
Substituting the photochemical 
Equation 7 for output in Equation 8, 


Time to Turn Yellow (Seconds) 


o 3 6 9 12 5 8B a 
Retinol Irradiance (Quanta per second 
per square millimeter of retina) x10 


Fic. 13. Time to change hue asa function 
of retinal irradiance, for light at 610 mu. 
Each point is a mean of two judgments. The 
( is a plot of the iterative solu- 


oth curve i I i 
dos to the implicit Equation 10, using the 


parametric values listed in the text.) 
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and supplying the appropriate sub- 
Scripts: 
A,QM, [ mae 
$ x2 : ette] 
— CA,0M, [ CIO 


A,Q 


«7 Cg AgQ) ) 
KypaQp a rre | [°] 


where ¢ is the time of occurrence of yellow. 
This equation may be rewritten: 


+ 


LEM. AQ gt CH AQ) 
K,+4,Q ` K,+4,0 
zl K,A, 
K,+4,0 
AO UKgtA | 
PES Ac) 
Kra i E 


where J = M,C/M,. 


A crude iterative procedure has 
been used to find the values for the 
five parameters of this equation that 
yield a good fit to the data. The 
two smooth curves drawn in Figure 
13 are a plot of Equation 10, when 
the following values of the param- 
eters are used: 


time constant of regeneration of red system, 
K, = .010 sec 

time constant of regeneration of green system, 
Kı = .0083 sec 

photosensitivity of red system at 610 mu, 
A, = 1.70 x 10715 
photosensitivity of green system at 610 mu, 
A, = 1.32 X 10-15 

ratio of amount of green to red pigment multi- 
plied by the output ratio for yellow judgment, 
J —1.18. 


These preliminary values may be 
compared with Rushton's (1958) esti- 
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mates: 
K, = .008 sec 


Ko = .008:sec+ 
4 = 255» 10™ 


for the cone system as a whole, based 
upon measurements of the bleaching 
of cone pigments by white light, and 
corrected for an ocular transmission 
loss of 50%. Assuming that only 
an R and a G pigment are present, 
and correcting for 50% transmission 
losses, the estimate of A for the entire 
cone system on the basis of the pres- 
ent data is: 


A,=1/.5A,9= 2[1—(1-A1) (0 —42] 
= 2(Ar +A) 
A= 6X 1075 cm? 


The value of A, determined from the 
color reversal data agrees much more 
closely with that quoted by Brindley 
(1960). From psychophysical data 
(quite different from those presented 
in the present paper), he estimates 
that A, = 1.61 X 10-5 (p. 217). 
The writer has been unable to find 
estimates of the constant “J” in the 
literature. (This constant represents 
a psychophysical linking hypothesis, 
in Brindley’s terminology-) 

The agreement between these esti- 
Mates is not perfect, nor is the fit in 
Figure 13. Its presentation is not 
intended to imply that the present 
formulation and these particular 
values are the "true" ones. The 
Dlot is meant to serve two purposes. 
First, it makes clear that the general 
formulation is consistent with the re- 
Ported hue changes when put into at 
least this specific form. Second, it 
Serves as a preliminary report on the 
extensive work now in progress at- 
tempting to evaluate, quantitatively, 
the specific theoretical arguments pre- 
Sented here. This work consists in 


making determinations of.the best fit 
values of the parameters of Equation 
10 under a number of independent 
conditions. For example, if this pro- 
cedure is repeated at a number of 
different wave lengths in the red 
region of the spectrum, the best fit 
values of Ka Ko and J should be 
identical at each wave length, but 
the values of A, and Ar should trace 
out the spectral sensitivity curves of 
the R and G systems. 

The quantitative procedure used 
for finding parameters of the color 
phenomena may be applied to the 
achromatic brightness reversal phe- 
nomenon as well. However, the par- 
ticular stimulus configuration used to 
gather the brightness reversal data 
presented in this paper do not lend 
themselves to straightforward analy- 
sis. For example, it is almost certain 
that the illuminances at the borders of 
a retinal region are more important in 
determining the apparent brightness 
of the region than are the illuminances 
in other regions ; and, for a bar against 
a background, stray light and small 
eye movements have large and in- 
determinate effects on the effective 
illuminances at the borders. An ex- 
periment is currently in progress in 
which these factors are minimized, 
while the time course of brightness 
reversal is determined in much the 
same way as before. The results of 
this experiment should be directly 
analyzable in terms of the photochem- 
ical hypothesis presented above. 

Observations on the rate at which 
the hue of the stimulus changes are 
qualitatively consistent with the the- 
ory, as plotted in the lower part of 
Figure 2. That is, the first change 
through yellow passes quickly, the 
stimulus actually appearing to be yel- 
low for a very short time. The second 
change, where the theoretically de- 
termined output curves are running 
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more nearly parallel, occurs very 
slowly and gradually. This is borne 
out quantitatively by the fact that 
the variance of the judgments of the 
time of second yellow point is appre- 
ciably greater than the variance of 
the first yellow point. The bright- 
nesses of the stimuli also agree quali- 
tatively with the theory. Although 
brightness has not been measured 
for the monochromatic stimuli, it is 
always high at first, gradually drop- 
ping to a much lower equilibrium 
level. 

From the theory as it has so far 
been stated, it might be expected 
that, just as a red light turns green, 
a green light should turn red, and a 
yellow one should not change in hue. 
This would indeed be the prediction 
if it happened to be true that (a) the 
rates of regeneration of the red and 
green systems were identical, and (b) 
the density of red and green pigment 
molecules were also identical. But if 
one regeneration rate were greater than 
the other, high-luminance stimuli near 
the yellow region would, at equilib- 
rium, take on the hue of the system 
that regenerates faster. From the 
present findings, this hue seems to be 
red, but the data are not yet complete 
enough to draw firm conclusions 
about the relative rates of regenera- 
tion. 

The fact that yellow and greenish 
yellow stimuli change toward purer 
greens before changing back through 
yellow would follow directly from the 
theory if there is more green than red 
pigment present in the stimulated 
region of the retina. Consider, for 

example, light of a wave length that 
looks yellow when it is first turned on. 
This means that it is initially isomer- 
izing red and green pigment molecules 
at the same rate (again assuming that 
C —1 for convenience). However, 
if there are actually fewer red mole- 


cules present per unit of area, the rate 
of isomerization of red molecules will 
immediately become smaller than the 
corresponding rate for the green sys- 
tem, and the stimulus should turn 
green. This sort of effect is superim- 
posed upon the processes already dis- 
cussed so that, at the proper irradi- 
ances, an initially green stimulus 
should first get greener, then change 
back through yellow to red, and 
finally again through yellow to green, 
as it does. 

It was mentioned above that, when 
the wave length is not too long (in 
the "orange") and the irradiance is 
extremely high, the early stages of 
hue change seem to be obscured bya 
wave of desaturated pink or violet. 
This effect would be expected if 
Short wave sensitive Systems were 
present and did produce desaturation. 
In this case, during the early stages 
of adaptation, both the R and the G 
Systems are so intensely bombarded, 
relative to the short wave sensitive 
systems, that the R and G outputs 
drop to values comparable with the 
short wave systems, and desaturation 
Occurs. 

Note: One should exercise caution 
when viewing stimuli of very high 
luminance. Fechner permanently 
damaged his retinae by staring at the 
sun, and ophthalmologists can count 
on an increased patient load after 
solar eclipses. Since there have been 
very few systematic studies of the 
Pathological effects of intense lights 
(see Whiteside, 1960), there are no 
clear rules on what should be avoided. 
In general, the sun should not be used 
as a source unless it is very strongly 
filtered, and any source that contains 
a large amount of infrared radiation 
should be filtered by heat-absorbing 
glass. The writer and many others 
who have observed the phenomena 
discussed in this paper exercised these 
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precautions and experienced no dis- 
comfort except for afterimages that 
last for a few minutes. 


SUMMARY 


Evidence concerning the absorption 
of light quanta by visual pigment 
molecules is used to derive a theoret- 
ical relationship between the param- 
eters of light incident on the retina 
and the output of the receptor cells. 
This relationship leads to the follow- 
ing prediction : when two retinal areas 
are illuminated, one very intensely 
and the other still more intensely, 
there will be a period of time during 
Which the less intensely illuminated 
region will appear brighter than the 
more intensely illuminated one. 
Data verifying this prediction are 
Presented. 

When the theoretical treatment is 
expanded to include certain assump- 
tions about retinal color mechanisms, 
It provides an explanation of some of 
the changes in hue that occur when a 
very intense colored stimulus is viewed 
steadily. Data on the temporal 
Course of these hue changes are pre- 
Sented, and their relationship to the 
theory is discussed. This kind of 
analysis is shown to be a possible 
means for the quantitative evaluation 
of certain physiological properties 
9f the visual system. 
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PRACTICE AS A PROCESS OF SIMPLIFICATION 


MARSHALL B. JONES! 


United States Naval School of Aviation Medicine, Pensacola, Florida 


From time to time in the last 30 
years a psychologist has intercorre- 
lated successive trials in the practice 
of some task or other. And in every 
instance the correlations have ar- 
ranged themselves in a very definite 
pattern. 

The correlations which appear in 
Table 1 are a good example. The 
task here is the Two-Hand Coordina- 
tion Test, which requires the subject 
to move a pin around a clover shaped 


TABLE 1 


INTERTRIAL CORRELATIONS FOR THE 
'Two-HaND COORDINATION TEST 


Trial] X1 | Xs | Xs | X« | Xs | Xe | Xs | Xs 


Xi Sr td rc |i 13 |11 | n 
X: .87 |.87 | .84 | .82 | .82 |.82 
Xs .91 | .89 |.87 | .85 |.86 
Xa. .91 | .88 | .86 |.88 
Xs .89 | .90 | .90 
Xo 93 | .93 
X; 94 
Xs 


runway by the coordinated turning of 
two control handles. The subjects 
are 152 basic airmen; and the trials 
are 60 seconds long, with 30-second 
rest intervals between trials.? 

But notice the pattern. For each 
trial the correlations are highest in 
the superdiagonal (just above the 
main diagonal) and decrease mono- 
‘tonically as we move either upwards 


1 Opinions or conclusions contained in this 
paper are those of the author. They are not 
to be construed as necessarily reflecting the 
view or the endorsement of the Navy De- 

artment. b 
p 2 For the use of these data I am indebted 


to E. A. Bilodeau. 


or to the right, until we come to the 
smallest correlation, which is in the 
upper right-hand corner of the matrix. 
There are a few inversions in the last 
two columns but they are minor, 
amounting in no case to more than .02. 

This pattern, this “superdiagonal 
form" as I will call it, can be seen in 
every practice matrix which has ever 
been published. It seems to be a 
universal property of practice that à 
trial correlates strongest with its 
neighbors and weakest with the first 
and last trials in the series (Edgerton 
& Valentine, 1935; Fleishman, 1953, 
1960; Fleishman & Hempel, 1954, 
1955; Fleishman & Parker, 1959; 
Greene, 1943; Houston, 1950; Melton, 
1947, p. 1022; Perl, 1934; Reynolds, 
1952a, 1952b; Viteles, 1933). 


PRINCIPAL POSSIBILITIES 


Now what are we to make of this 
fact? Structurally, there are three 
principal possibilities. The first, in 
an historical sense, is the Woodrow 
two-factor structure. 

In 1939 Herbert Woodrow observed 
that every practice matrix which was 
known at that time could be explained 
in terms of two common factors with 
positive weights which increased on 
one factor and decreased on the other 
as one went through the practice 


series. In other words, he posited 
that: 


X; = a,F, + bi F2 + aS 
Xo = asFi + baF2 + 6282 
Xs = ass Ph bsFe + ¢3S3 


X, = anFi + b4Fs + CnSn 
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where the X; were the trial scores, Fi 
and F, were the two common factors, 
the S; were specific factors, and load- 
ings, a; and b;, obeyed these laws: 


Oca Xa E-X€asaS as 
and 
b> be> bse > bn 0 


We now know that the Woodrow 
Structure cannot explain all practice 
matrices. Sometimes, particularly in 
long matrices, a third Woodrow factor 
is required, or even a fourth. Never- 
theless, the great bulk of practice 
matrices can be explained in terms of 
two Woodrow factors, and the present 
matrix is a case in point. If we 
hypothesize that: 


DUE GL y= 92 $1.57 be = .24 
O=.75 ap=.94 bo =.57 be = 19 
a= .83 a,=.94 bs=.47 br =-19 
4.87 og=.94 be =.39 bs = 19 


we get the correlations which appear 
E Table 2. Clearly, the fit is good. 
©, as far as the correlations are con- 
cerned, the Woodrow structure is a 
tenable hypothesis. 
n second principal possibility is 
T double accumulation structure." 
Or an eight-variable matrix the 
equations are: 


Xi fi foc-fasl-fic-fs-fe th +51 
Meefththetitletle th fte 
Xifefefebfiehifs ftf 
Xi fifi fi-fetls +futfethistss 
Xs fifi fifi -Efi- fu fiit fiis 
Xe He heh kf 
Xf  beblethotfathethetss 
Xin, fife fofi fii fats 
Where the trial scores are in standard 
Dn and all factor scores are in devia- 
^ form only? ln some respects 


3 
thet the double accumulation structure and 
simplex the right-hand side of each struc- 


TABLE 2 


‘THEORETICAL CORRELATIONS FOR THE Two- 
Hanp COORDINATION TEST UNDER A 
Wooprow HYPOTHESIS 


xi 48 | .77 |.75| .70 |.68 68 |.68 
Xs 89 |.87 |.83 |.81 | -81 81 
Xs 91 | .88 |.87 | .87 |.87 
Xs .89 |.89 | .89 |.89 
Xs .91 | .91 |.91 
Xs .92 |.92 
X: :92 
Xs 


double accumulation resembles Wood- 
row structure. Both hypotheses call 
for a two-process interpretation of 
practice, in which one process gains 
in importance with practice and the 
otherloses. They differ, however, in 
the way these two processes are 
conceived. In the Woodrow struc- 
ture each process is represented by a 
single factor with weights which de- 
crease or increase as practice proceeds. 
In the double accumulation structure 
each process is represented by a set of 
factors in which the included factors 
accumulate or drop out with each new 
trial. 

The double accumulation structure 
isa special case of the “open contiguity 
structure,” and there are some prac- 
tice matrices which require the more 
general form. In the present example, 
however, if we hypothesize that 
ej? = .70 and that: 


tural equation should be multiplied by an 
arbitrary constant. Otherwise, the structure 
conditions the variances and covariances as 
well as the correlations. Ordinarily, these 
constants are dropped as their inclusion would 
only complicate the equations to no particular 
advantage. Whenever we deal with vari- 
ances or covariances, however, these con- 
stants must be reinstated. 
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TABLE 3 
THEORETICAL CORRELATIONS FOR THE Two- 
HAND COORDINATION TEST UNDER A 
DOUBLE-ACCUMULATION HYPOTHESIS 


Trial] Xi | Xe | Xa | Xa | Xs | Xe | Xs | Xs 


Xi 49 1.77 |.75 | .73 | .70 | .70 | .70 
Xo .89 | .87 | .85 | .82 | .82 | .82 
X; 91 | .89 |.86 | .86 |.86 
5 91 |.88 | .88 |.88 
Xs 90 | .90 | .90 
Xe 93 |.93 
Xi 94 
Xs 


the double accumulation structure 
generates the correlations which ap- 
pear in Table 3. In this instance, 
therefore, as in most practice matri- 
ces, the correlations are easily ex- 
plained in terms of a double accumu- 
lation hypothesis. 

The third principal possibility, the 
simplex, differs fundamentally from 
the first two in that it calls for a one- 
process interpretation of practice. 
Practice, under a simplicial interpre- 
tation, is a process of simplification, 
exclusively. For an eight-variable 
matrix the equations are: 


Xi —febfebf-fefstfetftfckts 


dup ftfifeofit fit f-fil se 
X3 Fit fit- fit fe fifilss 
XT frtfets 
Xg— fs+ss 


where all variables are in deviation 
form only.* 


4 The requirement in the simplex that all 
variables be in deviation form only is im- 
portant. Were we to require that the trial 
scores be expressed in standard form or, 
which is the same thing, that the e" scores 
all have the same ard jm aie 

itrary constants), then e 
dd ies a single eee 
—which is a perfectly valid structure 7 
ei. place ^i the context of practice sm) 
it does not generate a superdiagona c 
For more on the single accumulation 
ture, see Jones (1960, pp. 50-51). 
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To generate specific correlations 
from a simplex we must hypothesize 
two kinds of parameters: the com- 
munalities 


and the complexities 


n (IY 


In the present instance, if we set: 


h?=.79  h$—.91 6-100 cs=.86 
h$-.87  h$-.93 c= 93 £57.83 
h$—.91  A?—.94 c= 90 6.83 
h?=91 he=.94 c= .88 c= .83 


we get the correlations which appear 
in Table 4 and a good fit. This par- 
ticular example, moreover, is not at 
all exceptional. Every known matrix 
of correlations between trials of prac- 
tice can be fit simplicially. 

I should also point out that, right 
now, there does not exist any way of 
testing the significance of these fits. 
However, no one of these hypotheses 
leaves a residual correlation greater 
than .02, and we hardly need a test to 
tell us that residuals of this magnitude 
are not significant. Even if they were 
significant, the fact would be of no 
particular moment. True, the hy- 
potheses would have to be modified; 


TABLE 4 


THEORETICAL CORRELATIONS FOR THE Two- 
HAND COORDINATION TEST UNDER A 
SIMPLICIAL HYPOTHESIS 


Trial Xi | X: | Xa | Xé| Xs | Xe | Xs | Xs 
Xi AA LITE orsi] 78) TL |i. | 72 
Xo -86 |.85 | .83 |.81 | .81 | .81 
X; -89 | .87 |.86| .86 |.86 
Xi .89 |.87 | .87 |.87 
Xs 89 | .89 |.89 
Xe .93 |.93 
X: 94 
Xs 


BM 
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but the modifications would be so 
paltry in amount that nothing of any 
real consequence would be altered. 


MOLAR CORRELATIONAL ANALYSIS 


To the learning theorist the next 
question is obvious: Which of these 
three structures best expresses the 
true nature of practice? I intend to 
answer this question, but first I want 
to say a word or two about correla- 
tional analysis. 

As i the last 30 years the treatment 
fe tation matrices in psychology 
E een completely dominated by 
actor analysis. There has been criti- 
cism, of course. In fact, there are a 
ay many psychologists, perhaps 

n a majority, who won't have any 
Dart of factor analysis. But for two 
re now there has been no 
o ce to factor analysis. If a 
d calculated a correlation matrix, 

Ne only existing way of summarizing 
a nt in structural terms has been 
E actor analytic way. In the last 
cl years, however, this situation has 

hanged. Beginning with Louis 

uttman’s work on the simplex in 
he and continuing with my two 
nographs on correlational analysis 
jones, 1959, 1960), an entirely new 
iscipline has been created, a com- 
Pistely different way of treating a 
Orrelation matrix. 
_ This new discipline, molar correla- 
tonal analysis, is coextensive with 
Ed analysis in practice and ab- 
à utely incompatible with it in the- 
dr The essence of this incompati- 
lity isa matter of approach. Factor 
analysis is an inductive method. The 
Se analyst starts with the correla- 
ee and then attempts by a perfectly 
Ott hanical computing regime to grind 
AE his factor structure. He knows 
S. well, of course, that there are a 
at many, substantially different 


structures any one of which could 
generate the correlations. But the 
factor analyst has faith. Aside from 
their intercorrelations he may know 
nothing whatever about the variables 
which enter into the matrix. Never- 
theless, he remains quite confident 
that by estimating communalities, 
factoring the matrix, and rotating 
axes (according to his preferred 
method) he will arrive at the structure 
which is actually responsible for the 
correlations being what they are. 

The molar analyst is a much less 
believing creature. As he sees it, 
correlational analysis is a strictly 
deductive discipline. If we are given 
some hypothesis as to how the vari- 
ables are structured, then algebra 
alone can tell us what the correlations 
ought to be—but that’s all it can do. 
The construction of theories is a busi- 
ness for the psychologist, as psycholo- 
gist, not the statistician. All the 
statistician can do is to give us the 
means for tracing out the conse- 
quences of our thinking. He can tell 
us which patterns follow from which 
structures. He can catalogue the 
more important of these structures 
and these patterns. He can even 
show us how to fit a structure to a 
correlation matrix. But beyond this 
he simply cannot go. 

In itself, this difference would en- 
able us to construct an entirely differ- 
ent approach to the treatment of 
correlation matrices. But there are 
several other respects in which molar 
analysis differs fundamentally from 
the factor analytic tradition. The 
most important of these is the em- 
phasis which the molar analyst places 
upon correlation pattern. A correla- 
tion matrix is more than a collection of 
correlations. As often as not, these 
correlations, Or à substantial number 
of them, display a definite pattern. 
We have already seen the superdi- 
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SIMPLICIAL ARGUMENT 


Let's return now to the problem at 
hand, the superdiagonal forms of prac- 
tice and their structural interpreta- 
tion. At the present stage of the 
game the arguments for and against 
our three main hypotheses are far 
from decisive. In my opinion, the 
simplex has the best of it on the avail- 
able evidence, but we have a long way 
to go before any one of these hypoth- 
eses or, for that matter, any other 
explanation of the correlations can be 
regarded as definitely established. 

The case for the simplex rests on 
two main arguments. The first, and 
most weighty, comes from experimen- 
tal psychology. Since the days of 

Bekhterev, psychologists have known 

that learning involved the progressive 

focussing of behavior on the one re- 

Sponse which was successful in secur- 

ing reward. At first the dog starts 

wildly in the stock. He makes all 
sorts of inappropriate responses; in- 
deed, initially, it is only by accident 
that he lifts his paw. As practice 
continues, however, his behavior grad- 
ually focalizes, until in the end the 
dog barely lifts his Paw the amount 
required to avoid the shock. As we 
observe it, in other words, practice js 
a process of simplification. How 
very natural therefore, that in its 
differential composition (which we do 
not observe) practice should also be- 
come simpler and simpler as one trial 
follows another. 

In making this argument I want to 
be quite clear that I am talking about 
practice, not learning. There are 
some forms of learning, problem Solv- 
ing for example, which almost cer- 
tainly are not processes of simplifica- 
tion. If Krechevsky (1932) had in. 
tercorrelated successive trials in his 
famous studies of hypothesis" forma- 
tion in rats, it is quite possible that 
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the correlations would have shown 
the superdiagonal form. But in this 
case the originating structure could 
hardly have been a simplex. The 
rats' "hypotheses," after all, came and 
went. First they tried one thing and 
then another, first a spatial hypothe- 
Sis and then a visual one. This is the 
kind of behavior which generates an 
open contiguum, a structure in which 
factors can come as well as go, not a 
Simplex, in which they can only go 
(or come).7 By "practice" I mean 
that portion of the learning sequence 
Which follows the formation of the 
correct hypothesis, Learning doesn't 
end just because the person or animal 
has finally understood the problem. 

ere remains the “stamping-in,” the 
mastery of the problem, And it is 
this stamping-in, this practice, which 
the simplicial hypothesis requires to 

€ à process of simplification. 

he second argument has to do with 
the external relations of early and late 
Stages of practice, According to the 
simplicial hypothesis, the early stages 
of practice contain a variety of factors 
in addition to those which are most 
Particularly Tequired in the practice 
task itself. This hypothesis implies 
that, if we correlate the trials of prac- 
tice with materials which are categori- 
cally distinct from the practice task, 
then, on the average, the early stages 
of practice Should be more strongly 
related to the test materials than the 
later ones, On the other hand, if we 
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correlate the trials of practice with 
very similar materials, we should ex- 
pect just the reverse result: in general, 
the later stages should be more 
strongly related to the test materials 
than the early ones. 

Unfortunately, the only evidence we 
have which is relevant to these two 
consequences concerns psychomotor 
tasks. In this one area, however, the 
evidence is very clear-cut. When 
they used printed tests as the external 
measures, Adams (1953, 1957), Fleish- 
man (1960), Fleishman and Hempel 
(1954, 1955), and Reynolds (1952a, 
1952b) all found that the predictabil- 
ity of the trials on a psychomotor 
task generally decreased with prac- 
tice. On the other hand, these same 
Investigators have also shown that the 
relationship between one psychomotor 
task and another usually increases 
With practice, on either one of the two 
tasks or both. And if we take an area 
more or less in between, say training 
criteria, we find a more or less neutral 
result: for example, the correlation be- 
tween the Air Force pass-fail pilot 
criterion and scores on the Complex 
Coordination Test neither increases or 
decreases with practice. Unless we 
make special assumptions, however, 
None of these results follows from 
either the Woodrow or the double 
accumulation structure. All told, 
therefore, the existing evidence on the 
relationship between trials of practice 
and external measures strongly sup- 
Ports the simplicial hypothesis. 


IDENTIFYING FACTORS 


In all branches of study the rec- 
Ognition of form is followed by the 
analysis into parts. Once the astron- 
Omer has recognized a star he wants 
a know what it's made of. Once 
Ets biochemist has isolated a sub- 

nce he wants to know it's composi- 
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tion. And once the psychologist has 
decided, even tentatively, that prac- 
tice is a process of simplification he 
wants to know what the elements are 
which participate in this process; he 
wants to identify the simplicial factors. 

Now, the only way the psychologist 
can identify these f; is to administer a 
variety of other tests to the same sub- 
jects who worked on the practice task. 
Once this is done, he can correlate 
the reference tests with the stages of 
practice and, through these correla- 
tions, obtain the correlations between 
the reference tests and the simplicial 
factors. If these final correlations 
are high, then the factors are identi- 
fied. If not, we must try again, with 
other tests. 

To illustrate this logic let us rework 
the results of a recent study by E. A. 
Fleishman.’ In Table 5 I have repro- 
duced the intertrial correlations on 


$In his analysis of these data Fleishman 
threw both the trials of practice and the refer- 
ence tests into one grand matrix and per- 
formed a factor analysis. There is, I think, 
a fatal objection to this procedure, quite 
apart from Fleishman's use of factor analysis. 
]t is that the structure of the practice task 
becomes dependent upon the test materials. 
If we use one battery of tests, we will get one 
structure for the trials of practice; if we use 
another, we will get a second; if we use a third 
battery, we will get still another. But it's 
obvious, it seems to me, that the trials of 
practice on, say, the Rotary Pursuit Test 
have some structure, altogether apart from 
the tests we happen to administer. This 
structure doesn’t have to be a simplex. Asa 
factor analyst, Fleishman might very well pre- 
fer the Woodrow structure. But however 
we interpret them, it seems quite clear that 
there is only one true structure for a single 
set of variables. Under Fleishman’s ap- 
proach the factors we are trying to identify 
are a function of the materials with which we 
are trying to identify them. The only way to 
resolve this dilemma is the molar way, to 
analyze the intertrial correlations first, apart 
from the reference tests, and then, having 
arrived at a structural recognition of the 
factors, to attempt an indentification in 


substance. 
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TABLE 5 


INTERTRIAL CORRELATIONS FOR THE ROTARY 
Pursuit TEST 


Trials | 1 3 5 7 9 11 13 15 


^ 88 |.84| .77 | .74 | .70 | .64 
5 90 1.86 |.83 | .79 | .74 
7 .89 | .86] .83 |.80 
9 -88 | .87 | .82 
11 .88 | .84 
13 85 


^ Fleishman reported his results for the odd-num- 
bered trials only. 


the practice task, which was the 
Rotary Pursuit Test. This matrix 
has the superdiagonal form. And if 
we hypothesize a simplex and set: 


hY=.86 he=.96 (0,2100 cs=.73 
h$-.92 he=.94 c= .89 02.70 
he=.94  h?—.90  c— .82  c—.69 
h?=.98 hse=.82 c= .78  cs=.67 
we get the theoretical correlations 


which appear in Table 6. Since the 
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TABLE 6 


THEORETICAL CORRELATIONS FOR THE 
Rotary Pursuit TEST UNDER A 
SIMPLICIAL HYPOTHESIS 


7] 9 |0 | HS 


1 -79 | .74 | .72 | .66 | .63 | .61 |.56 
3 86 |.83 1.77 |.73 | .71 |.65 
S .91|.85 |.80 | .77 | .72 
7 91 |.86 | .83 | .77 
9 911.89 |.82 
11 91 [.84 
13 .84 


15 


fit is good, our hypothesis is tenable. 
It remains to find out what the hy- 
pothesized factors are, not structurally 
(we already know that) but substanti- 
ally. 

In addition to the Rotary Pursuit 
Test Fleishman administered a battery 
of 17 reference tests, 7 paper-and-pencil 
tests, and 10 apparatus tests? Now, 


° The reader will notice that for the printed 
tests, Tests 1 through 7, the early trials are 
more strongly related than the late ones, 


TABLE 7 
CORRELATIONS BETWEEN THE REFERENCE TESTS AND THE TRIALS OF PRACTICE 


Trials 
Test 

: 3 5 7 9 11 13 15 
General mechanics 49 22 22 18 20 9 15 12 
Tool functions 19 22 18 46 117 14 A3 07 
Speed of identification . .20 .24 26 .26 .29 36 27 26 
Instrument comprehension | .20 22 22 20 21 25 20 23 
Visual pursuit .30 33 25 23 23 24 22 21 
Aiming 27 | .32 | 34 | 29 | 32 | .30 | 227 29 
Marking accuracy -26 33 33 31 34 .35 38 34 
Purdue assembly board oy -21 18 16 Aed 45 17 AT 
Santa Ana dexterity 28 ES 39 38 .32 .36 37 34 
Rate control 25 r^ 19 21 18 A7 12 d 
Linear pursuitmeter 23 55 8 2 2 x de um 
da rs 27 25 29 27 .29 26 29 29 
pore SET 24 | 324 | 26 | pa || 222 le oA | 2S 
Reaction Time 34 34 Eo 3 wi EA pt ae 
Track tracing v. 47 336 ad We Ea E: e eo 
Complex coordination 1 25 E Es s Ds M EA 
Reaction Time II E . E 
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TABLE 8 
CORRELATIONS BETWEEN THE REFERENCE TESTS AND THE SIMPLICIAL’ FACTORS 
Factors 
Test 
fi f: fs fe ts fe f fs 

General mechanics o2| —os| | .06| —.11} 0| 1| 2 
Teol functions =07| 08| =10] <03 | —04| —.21) 1| 2i 
a of identification .05 .08 00 42 .00 AL 321 22 
Vis pment comprehension 44 | —.11 20 .06 | —.07 .00 11 | 22 
inis pursuit 07 | —.03 07 06 | —.04 | —.42 AL) 33 
Markt 14 | —.03 03 42 | —.15 A1 sal .30 
Burdi ng accuracy .02 19 A3 1.2 | —.04 .00 32 229 
Eus nesembly b .02 .05 .00 06 | —.07 | —.16 .05 22 
Di a na dexterity .02 44 33 | —.12 .00 46 37 | «31 
Then, ntrol ^ 49 | —.05 .00 | —.06 .11 | —.32 .00 27 
Rate pe uenuimeten .05 .05 | —.07 .06 da | —.21 | —05| .25 
Rotar movement .28 .03 | —.03 .06 | —.04 05 tt 21 
Y aiming .09 ai .00 .09 | —.04 .21 | —.16 | -30 
TRE Time I 26 03 | —.07 | —.03 | —.04 SEI .00 | .26 
Can ‘leah E 12 .05 07 .00 .00 .05 .00 | .37 
R plex coordination 21 .22 .00 | —.03 .07 | —.11 | —.11 ES 

eaction Time II d6| 219 13| —06| —07 | -16 | 42 23 


eee assume that the test-trial cor- 
re em, which appear in Table l; 
Si mediated by the simplicial factors 
EU we can calculate the correlations 
eg the tests and the simplicial 
83) op odes, 1960, pp. 28-31, 80- 
e hese last correlations, of course, 
Brot necessarily exact, because the 
Sc pres under which they were 
ated could be in error. How- 
ie when the simplicial communali- 
are as high as they are for the 
a Pursuit Test, ranging from 86 
at )8—and the test-trial correlations 
MT as low as they are in this instance 
oye tun little risk of serious error in 
si epting the hypothesis of exclusive 
Mplicial mediation. And under this 
tp pothesis, the correlations between 
€ reference tests and the simplicial 
actors take on the values which ap- 
Pear in Table 8. : 


tuntteas for the apparatus tests, Tests 8 

oe 17, just the reverse is true. In this 

are i these tendencies are weaker than they 

So qi any other published study, but even 
Sy are still there. 


These values, unhappily, are low. 
Not one of them ranges beyond the 
40s and only three are that high. In 
this particular instance, therefore, our 
effort to identify the simplicial factors 
was a failure. Nevertheless, there is 
nothing the matter with the method; 
if we persist with it, then sooner or 
later we will find tests which correlate 
strongly with the simplicial factors. 
When this happens, we will have 
identified the fi. 


CONCLUSION 


In the end, of course, hypothesizing 
structures and identifying their fac- 
torial elements will not suffice. If we 
really want to understand the super- 
diagonal forms of practice, there is 
nothing for it but to experiment with 
them. How would these patterns 
change as à function of distribution of 
practice, Or of intermittent reward ? 
How would they be affected by prior 
practice on à related task? Sooner or 
later we must augment the passive 
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arts of correlational analysis with 
active experimentation. We must 
theorize, of course; and this theorizing 


wil inevitably be correlational in 
character. But we must also ex- 
periment. 
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SUPPLEMENTARY NOTE 


In this note I propose to take up the 
fitting procedures for the three structures 
which were treated in the text of this 
paper, the Woodrow two-factor struc- 
ture, the double accumulation structure, 
and the simplex. As a necessary pre- 
liminary, I will also treat a few, more 
elementary forms. 

Mathematically, the best way to fit 
any molar structure is by the method of 
least squares. Given a structural hy- 
pothesis, we know that the theoretical 
correlations, pij, are some function of the 
structural parameters, i Puts Pre 
We could, therefore, proceed to minimize 


the squares of the differences between the 
theoretical and the empirical correlations, 
i.e., between the p;; and the ri. To do 
this we need only differentiate the ex- 
pression : 


LX (ps5 ria) 


and solve the normal equations which 
result. 

There is, however, a difficulty with 
this approach: The number of normal 
equations which must be solved in order 
to determine a best fit is prohibitive. If 
we use least squares, every fitting process 


PRACTICE AS A Process OF SIMPLIFICATION 


requires us to solve at least as many 
Simultaneous equations as there are 
variables in the matrix. And this, in its 
turn, requires computing equipment of a 
high order. It is desirable, therefore, to 
find Some other, less heroic means of 
arriving at essentially the same results. 

n all the fitting procedures which I 
will describe the general process is the 
same. We begin by positing a tentative 
set of parameters, which we then adjust 
until we get what appears to be a best 
fit. These tentative values, of course, 
are not taken at random. Knowing 
what the structure is, we can determine 
tentative values which will require a 
minimum of adjustment. The only 
problem is to develop sound procedures 
or arriving at these tentative values. 


SPEARMAN UNIT FACTOR 


Ee us begin with the Spearman unit 
= oes Le., with the hypothesis that the 
M m which enter into the matrix 
‘ain a single common factor. If the 
2s is arranged so that the variable 
em the biggest loading. on this one 
mr ames first, the variable with the 
Hos iggest loading second, and so on, 
> £s to the variable with the smallest 
ading, which comes last, the structure 
8enerates a characteristic pattern. This 
Pattern, which I call a ‘monotonic 
perch ya locates the biggest correlation 
if the upper left-hand corner of the 
enr and the smallest in the lower 
eu t-hand corner with the others spread 
Nr in between. If the loadings are ar- 
t ged the other way around, i.e., with 
© smallest loading first and the biggest 
3st, then the pattern will be reversed: 
€ biggest correlation will be in the 


TABLE A1 
A Monoronic HIERARCHY 


Xi X: xi xa 
ra i 
63 64 | 66 
x ‘70 72 
X 74 
X, 
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TABLE A2 


SoME THEORETICAL CORRELATIONS UNDER A 
SpEARMAN Unit-Factor INTERPRETATION 


Trial X: X: Xs X. 
Xi .63 .64 .66 
Xa -10 12 
Xs 43 
Xi 


lower right-hand corner and the smallest 
in the upper left-hand corner. 

The matrix in Table A1 displays this 
second form of the monotonic hierarchy. 
On a purely qualitative basis, therefore, 
the hypothesis of a Spearman unit factor 
is tenable. But before we can accept 
this hypothesis as a possible explanation 
of the correlations, we must first specify 
values for the factor loadings and at- 
tempt a fit. 

Under a Spearman hypothesis the 
empirical correlations: 


rij © aid; 


where a; is the loading on the one com- 
mon factor of the ith variable in the 
matrix. It follows that: 


t113 i 
a =~ | — 
t23 


a, = (carey 
T'(n-1n 

In other words, working with the two 
"ends" of the matrix we obtain two 
tentative values for the first factor load- 
ing, aj. If we compromise these two 
values, we will get a “balanced” single 
value, which, in this instance, equals .76. 
Then, since: 


we can go ahead and determine the ten- 
tative values for the remaining variables. 
In the case at hand we get these values: 


d 56 a; = .84 
as = .83 as = .87 


Generating the theoretical correlations 
from these parameters we get the matrix 
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idis 
i ble A2. Ordinarily, 
ich appears in Ta 5 
uon BE to improve upon the T "M 
adju, ting the a;, but in this instance £ 
d iE about perfect as n Rr gap 
i tion is .01, 
iggest residual correlation 0 
Ea EA one is that big. It's gur 
Beat therefore, that the cpt a pe 
S ined as originating in a single co 
mon factor. 


AUGMENTED SPEARMAN 
Unit FACTOR 


ugmented Spearman unit 
pane Tae two-factor ee in 
which the loadings of all the el 5 on 
one of the two factors are equal. um 
relationally, the effect of this secon 
factor is to add a constant quantity, 
which equals the loading on this second 
factor squared, to all of the correlations 
in the matrix. Now, consider the ma- 
trix which appears in Table A3. > Quali- 
tatively, the pattern it displays, ie., the 
monotonic hierarchy, is consistent with 
an augmented-Spearman interpretation, 
It remains to be seen, however, whether 
or not a good quantitative fit is possible, 
In fitting an augmented Spearman hy- 
pothesis our first task is to determine the 
magnitude of the one loading on the 
second factor. Consider, therefore, the 
three "principal points" of the matrix, 
i.e., the upper left-hand corner, the upper 
right-hand corner, and the lower right. 
hand corner. Under a nonaugmented 
Spearman hypothesis: 
Pit)» — Tinton) zz 0) 
If this quantity is either negative or zero, 
then the fit cannot be improved by means 
of the additive constant (which 


thé must be 
positive). If this difference is Positive, 
TABLE A3 
ANOTHER Monotone HrERACHY 
Trial xi X: Xi Xi 
Xi .88 17 n 
X: 13 -67 
X; 63 
X4 


im 
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.. TABLE A4 


Some THEORETICA‘s, CORRELATIONS UNDER AN 
AUGMENTED SPEA RMAN UNIT-FACTOR 
INTERPRETATION 


Trial xi X: X f ] x. 
X, 88 | 37 | m 
X. 74 69 
x KZE 
X, 

however, 


» as in the present instance, then 
by subtracting a fixed positive amount 
from all the correlations in the matrix, 
the theoretical Correlations can be 
rought more nearly into line with their 

empirical counterparts, Now, if we 
10 from all the correlations in 

the matrix, the tetrad difference between 
cipal points temains positive, 


i however, the differ- 
ence becomes negative, 


Suppose, there- 
fore, that we take .50 a 


ond factor squared, If 
all of the co i 
fita Spearm: 


y= a=, 


Q = 


which generate 
appear in Table 
1$ no need 
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TABLE A5 
A 'THREE-CORNERED PATTERN 


Trial} Xi | Xa | Xs | Xa | Xs | Xe | Xz | Xs 


A 88 | .77 | .71 | .69 | .70 | .72 | .74 
ee 13 | .67 | .65 | .67 | .69 | .70 
E ‘63 | .62 | .64 | .65 | .67 
a .60 | .61 | .63 | .64 
ae 63 | .64 | .66 
x 70 | .72 
x 74 


first factor: 
Qi = Q = as = 04 
and the loadings on the second factor: 


Bs = Bs = B: = Bs 


For the rest of it: 


as € as a; € as 


B12 Bre Bsz Bs 


Rc as in the full-Woodrow structure. 
Boc aHonally, however, the two 
ures are very different. The full- 
Msrow generates a superdiagonal 
" a while the half-Woodrow generates 
Bre ree-cornered pattern. Table A5 
oa NM an eight-variable example. The 
a ations are high in the three corners 
e matrix and drop off as one moves 
ae any one of them toward the middle 
the last column, the middle of the first 
Tow, or the middle of the superdiagonal. 
aes three-cornered pattern is naturally 
lvided into three submatrices: the two 
peat triangles, which contain the cor- 
Clations within the two halves of the 
Eb and the central block, which con- 
ins the correlations between them. In 
the two apical triangles the correlations 
are generated by augmented Spearman 
ypotheses. So, we already know how 
Ent these correlations. In fact, we 
of ny have. 'The two apical triangles 
i able A5 are the same two monotonic 
frarchies which we treated in the first 
rik, Sections of this note. Therefore, we 
ady have tentative parameters for 


and 


the half-Woodrow structure, namely: 


a =.71 fi = .66 


as = .76 B: = 58 
ag = .83 B: = 41 
a; = .84 Bs = .32 
as = .87 B =.00 


where æ represents the first four loadings 
(all equal) on the first factor and B rep- 
resents the last four loadings (again, all 
equal) on the second factor. 

These loadings, of course, were derived 
exclusively from the two apical triangles. 
To this point we have taken no account 
of the correlations in the central block. 
To do so, however, we need only generate 
the block correlations from our prelimi- 
nary loadings and see what sort of a fit we 
get. If the fit is good, our preliminary 
loadings become final. ]f it isn't, we 
must adjust the factor loadings so as to 
compromise the errors which they gener- 
ate in the apical traingles and in the 
central block. In the case at hand the 
fit is not good. But the remedy is easy. 
All we need do is assign B a somewhat 
higher value than zero. If we let 8 equal 
.20 and keep all the other loadings just 
what they were, we get the theoretical 
correlations which appear in Table A6, 
and a good fit. Of course, it might not 
have been possible to fit the central block 
and the apical triangles with a single set 
of loadings. If this had been the case, 


we would have had to reject the half- | 


Woodrow hypothesis. 


TABLE A6 


Some THEORETICAL CORRELATIONS GENER- 
ATED BY A HALF-WOODROW Two- 
Factor STRUCTURE 


Trial] X1| Xs | X: | X« Xs | Xo | X2 | Xe 


Xi 88 | .77 | -71 | .67 | -72 | .73 | .75 
Xs 774 | 69 | .66 | .71 | -71 | .73 
X; “63 | .62 | .67 | .68 | .70 
X .60 | .65 | .66 | .68 
au 67 | .68 | .70 
Xe ‘74 | .76 
Xi 77 
Xs 
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FurL-WoopRow Two- 
Factor STRUCTURE 


Fitting a alee ie E 
n one fact: If a correlation ma- 
med from a full-Woodrow 
structure, it is always possible to trans- 
form the correlations into a three- 
cornered pattern and to fit the trans- 
formed correlations with a half-Woodrow 
structure. Suppose, for example, that 
we multiply the loadings a; and b; in a 
full-Woodrow structure by k;, where: 


for the first 1/2 variables and 


> 


ky = 28 


bi 


for the last 1/2 variables. The result 
will be a half-Woodrow Structure, with 
the loadings (in the eight-variable case): 


b 
[mn as = (P) Bib Bs = b. 
5 


bs 


a2 = d a= a( 2) & - a (**) Bs = bs 
a3 = d ar = or( %) & = by( 2) B; = bs 


Came ai p= (2) Bs = bs 


And the correlations which this new 
structure generates, call them rz, will 
bear a simple relationship to the correla- 
tions of the original Matrix, namely: 


as = ds 


rij = rijkik; 


It follows that to fita full-Woodrow struc- 
ture we must do three things: transform 
the superdiagonal form into a three- 
cornered pattern, fit the three-cornered 
pattern with a half-Woodrow, and then 
transform the loadings for the half- 
Woodrow back to the original full-Wood- 
row structure. 

Our first task, then, is to find values 
for the k; which are inversely propor- 
tional to the full-Woodrow loadings, a, 
4» Gs, da, and bs, be, b; by The word 
"proportional" is important, because it 
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means that we have a great deal of lati- 
tude in transforming the superdiagonal 
form; more specifically, it means that 
the transformed correlations may be big, 
small, or middling, without any loss of 
rigor. As long as the new correlations 
can be fit by a half-Woodrow structure, 
whatever their absolute magnitude may 
be, then we can fit the original correla- 
tions with a full-Woodrow structure. 
But even so, how do we find a set of ki 
which will do the trick? 

Consider the correlations in the central 


block of a three-cornered pattern, They 
take this form: 


Xs Xe X; Xs 


Xx (nes B8) (aees+861) (cert 6B1) (aas - B8 
Xs (ncs -BBs) (ns--88:) (ear-+fBs) (aas--B82) 
Xs (aas+-883) (ace-+As) (aas-Ef;) (aes--B83) 
X, (254-881) (aos--88:) (ao;--88:) (nas -B82 


Which the reader will immediately rec- 
Ognize as a perfectly additive linear 
pattern, as these terms are used in analy- 
sis of variance. What we seek, there- 
fore, are values for the k; which will trans- 
form the central block of the superdiag- 
onal form into this pattern, Now, we 


know from the nature of the Woodrow 
Structure that: 


os [TR as Qs 
and 


Bi= 8:2 B1» B, 


Which means that th 


central block will be in the upper right- 
hand corner. The correlations will fall 
off as we go either to the left or down, 
until we come to the smallest rij’ in the 
central block, which will be in the lower 
left-hand corner. We also know, how- 
ever, that the correlations in the central 

lock of a Superdiagonal form have just 
the opposite pattern: The biggest corre- 
lation is in the lower left-hand corner 
and the smallest in the upper right-hand 
corner. It follows that the multiplica- 


üve constants, b; must obey these in- 
equalities: 


€ biggest r;' in the 


and 


Em——— pee 


PRACTICE AS A PROCESS OF SIMPLIFICATION 


nus the first four constants are multi- 
red only into the rows of the central 
E and the last four only into the 
E. ER Knowing. this much there is 
E difficulty in transforming the 
1 ions so that they have the pre- 
Scribed pattern. 
fw the correlations in Table AS, for 
that a By now it is quite obvious 
derived 1 correlations. in Table AS were 
the 2m the original correlations for 
us and Coordination Test. To 
ie the correlations of Table A5 
blied optan ri in Table 1 was multi- 
by k; and again by k; where: 


ky = 147 ks = 81 
ke = 95 ke = .86 
ky = .86 k: = .88 
k= .81 ks = .90 


age ve to do now, therefore, is to 
Stich he loadings of the half-Woodrow 
spondi re, the o; and f;, by their corre- 

ing ky; and we have our tentative 


Par, ss 
ét ameters for the original full-Woodrow 
Tucture, viz.: 


Saige ap = 94 by =.56 bs = «25 
E as = 97 bi = 61 be = -23 
E prs 4; 2.95 bs = 48 07 = .23 
17,88 ag=.97 by = .40 bs = -22 


Il 


These values do not conform entirely to 
out tenents of the full-Woodrow 
nor ES b; should not be less than bs; 
üt, Td à; be less than as. , Also, the 
eS ough good, can be improved. 

eed we can make adjustments for 

oadi things; and when we do, we get the 
ngs which are cited in the text and 


e i 5 
Correlations which are reproduced in 
able 2, 


DouBLE ACCUMULATION 
STRUCTURE 


a ^ a correlation matrix is generated by 
ouble accumulation structure, then: 


Prt (rij rin) (rin — ri) 


It € 
Saat fit is perfect, the relationship is 
In other words, the correlations 
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generated by a double accumulation 
structure have the same additive linear 
pattern which we remarked in the central 
block of the half-Woodrow pattern. 
This circumstance makes it possible for 
us to fit the correlations under a double 
accumulation hypothesis without ever 
specifying the parameters of the struc- 
ture, i.e., the variances of the factors. 
Let’s take a few correlations from the 
matrix in Table 1 and see how well they 
satisfy this linear, additive relationship. 
We have: i 


.89 


LI 


reg = 87 = 70 4- C07 + .12) 


ru = 91 = .70-+ C04 4- .06) = .90 
ras = 91 70 + (03 + .18) 
ry = 85 = .70 + (01 + 16) = .87 


It’s quite obvious from these relation- 
ships that Table 1 can be accounted for 
as a double accumulation structure. All 
we have to do is to treat the correlations 
in the first row and last column as theo- 
retical correlations and adjust them until 
we get a good fit, which, in the present 
case, is hardly much of a chore. Once 
they are made, we have the theoretical 
correlations, pij, which appear in Table 
3, of the text. 

It remains, of course, to determine the 
variances of the factors. In a double 
accumulation structure the first and last 
variables in the matrix have only one 
factor in common and that one factor is 
common to all of the variables in the ma- 
trix. It follows that the variance of this 
one factor equals ris. Now, apart from 
this general factor, a double accumula- 
tion structure contains 2(” — 2) other 
factors, which are divided into two ac- 
cumulation structures, each one con- 
taining (#7 — 2) factors. It happens, 
however, that the theoretical correla- 
tions in the first row of a double accumu- 
lation pattern are determined by one of 
these structures only, and the theoretical 
correlations in the last column are 
determined by the other, exclusively. 
This means that we can reel off the factor 
variances in a very simple way, once we 
know the theoretical correlations. If the 
factors are numbered the way they are in 
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FurL-WoopRow Two- 
Factor STRUCTURE 


Fitting a Ep pepe eve ie 
one fact: If a correla = 
eee from a full-Woodrow 
structure, it is always possible to trans- 
form the correlations into a three- 
cornered pattern and to fit the trans- 
formed correlations with a half-Woodrow 
structure. Suppose, for example, that 
we multiply the loadings a; and b; in a 
full-Woodrow structure by k;, where: 


for the first 2/2 variables and 


~ 


ka= Z 


bi 


for the last 2/2 variables. The result 
will be a half-Woodrow Structure, with 
the loadings (in the eight-variable case): 


b 
e — d as = a(t) Bi = by Bs = bs 
m= a =a) g = 4,(% Bs = bs 
bs ay 
a=a a= a(t) & - (8) Br = bs 
0/74 a = ag & (2) Bs = bs 
4 


And the correlations whi 
structure generates, call t 
bear a simple relationship 
tions of the original matri 


ich this new 
hem 7;;', will 
to the correla- 
X, namely: 


fij! = rkih 


It follows that to fit a full-Woodrow struc- 
ture we must do three things: transform 
the superdiagonal form into a three- 
cornered pattern, fit the three-cornered 
pattern with a half-Woodrow, and then 
transform the loadings for the half- 
Woodrow back to the original full-Wood- 
row structure. 

Our first task, then, is to find values 
for the k; which are inversely propor- 
tional to the full-Woodrow loadings, a,, 
an Q5, ds, and bs, be, bz, bs. The word 
"proportional" is important, because it 
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means that we have a great deal of lati- 
tude in transforming the superdiagonal 
form; more specifically, it means that 
the transformed correlations may be big, 
small, or middling, without any loss of 
rigor. As long as the new correlations 
can be fit by a half-Woodrow structure, 
whatever their absolute magnitude may 
be, then we can fit the original correla- 
tions with a full-Woodrow structure. 
But even so, how do we find a set of ki 
which will do the trick? 

Consider the correlations in the central 


block of a three-cornered pattern, They 
take this form: 


X; Xo Xi Xs 


X1 (aas- 881). (as-4-88:) (car +681) (aas-- BB) 
Xa (ao54-88:) (aos -88:) (aa;--88») (aas--88:) 
Xa (cees-+68:) (ans -885) (oar-+68s) (aas BB) 
Xa (aos--88)). (aes -88,) (aar+A8s) (nos -BB2 


Which the reader will immediately rec- 
ognize as a perfectly additive linear 
pattern, as these terms are used in analy- 
sis of variance. What we seek, there- 
fore, are values for the k; which will trans- 
form the central block of the superdiag- 
onal form into this pattern. Now, we 


know from the nature of the Woodrow 
structure that: 


os ax e; € as 
and 


Bi> 8:7 B3 p, 


Which means that the 
central block will be in the upper right- 
hand corner. The correlations will fall 
off as we go either to the left or down, 
until we come to the smallest r;;' in the 
central block, which will be in the lower 
left-hand corner, We also know, how- 
ever, that the correlations in the central 
block of a superdiagonal form have just 
the opposite pattern: The biggest corre- 
lation is in the lower left-hand cornet 
and the smallest in the upper right-hand 
corner. It follows that the multiplica- 


tive Constants, b; must obey these in- 
equalities: 


biggest r; in the 


and 
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P the first four constants are multi- 
Prem only into the rows of the central 
Ee and the last four only into the 
so on Knowing this much there is 
E difficulty in transforming the 
T ions so that they have the pre- 
Scribed pattern. 
B T the correlations in Table AS, for 
ia pe By now it is quite obvious 
detects correlations in Table AS were 
the T. rom the original correlations for 
i and Coordination Test. To 
Perm ES the correlations of Table A5 
EE rrelation rij in Table 1 was multi- 
by k; and again by k; where: 


ky = 147 ks = 81 
ke = 95 ke = .86 
ky = .86 .88 
R= st 90 


Mme oe to do now, therefore, is to 
cm loadings of the half-Woodrow 
Sonik e, the a; and fi, by their corre- 

ing ky; and we have our tentative 


par; " ha: 
3 ameters for the original full-Woodrow 
ructure, viz.: 


aq= 

s - 61 as=.94 b = .56 bs=.25 

E le 45 ag = .97 by = 61 b.—.23 
17,83 a, =.95 by = 48 br = 23 


% = 88 a,=.97 by = 40 bs = -22 


pee values do not conform entirely to 
"(sagas per of the full-Woodrow 
nor E b; should not be less than bs; 
fit as a; be less than ae... Also, the 
es ough good, can be improved. 
Dee we can make adjustments for 
oadi things; and when we do, we get the 
a ngs which are cited in the text and 
Correlations which are reproduced in 
able 2. 


DounLE ACCUMULATION 
STRUCTURE 


I H H 
a K a correlation matrix 1$ generated by 
ouble accumulation structure, then: 
Tür rint (ririn) + (ri r9) 


If the 


Cxact fit is perfect, the relationship is 


In other words, the correlations 
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generated by a double accumulation 
structure have the same additive linear 
pattern which we remarked in the central 
block of the half-Woodrow pattern. 
This circumstance makes it possible for 
us to fit the correlations under a double 
accumulation hypothesis without ever 
specifying the parameters of the struc- 
ture, i.e., the variances of the factors. 

Let's take a few correlations from the 
matrix in Table 1 and see how well they 
satisfy this linear, additive relationship. 
We have: 


ra = .87 = .70 + (07 + .12) .89 
ra = 912 70+ (.04 + .06) = .90 
912 .70 + (03 + .18) 

‘85 = 70 + (01 + .16) = .87 


LU 


fas 


I 


t37 


It's quite obvious from these relation- 
ships that Table 1 can be accounted for 
as a double accumulation structure. All 
we have to do is to treat the correlations 
in the first row and last column as theo- 
retical correlations and adjust them until 
we get a good fit, which, in the present 
case, is hardly much of a chore. Once 
they are made, we have the theoretical 
correlations, pii; which appear in Table 
3, of the text. 

It remains, of course, to determine the 
variances of the factors. In a double 
accumulation structure the first and last 
variables in the matrix have only one 
factor in common and that one factor is 
common to all of the variables in the ma- 
trix. It follows that the variance of this 
one factor equals 7i». Now, apart from 
this general factor, à double accumula- 
tion structure contains 2(n — 2) other 
factors, which are divided into two ac- 
cumulation structures, each one con- 
taining (#7 — 2) factors. It happens, 
however, that the theoretical correla- 
tions in the first row of a double accumu- 
lation pattern are determined by one of 
these structures only, and the theoretical 
correlations in the last column are 
determined by the other, exclusively. 
This means that we can reel off the factor 
variances in a very simple way, once we 
know the theoretical correlations. If the 
factors are numbered the way they are in 
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the text, we get: 


and 


os = pis — pos 


From these formulae I derived the values 
of the øz? which appear in the text. 


SIMPLEX 


In a Woodrow structure the param- 
eters are all of the same sort, namely 
factor loadings; similarly, in the double 
accumulation structure, the parameters 
are also of the same sort, in this case 
factor variances. But in the simplex 
there are two different kinds of structural 
parameters. Under a simplicial hy- 
pothesis, the theoretical correlations: 


: G <j) 

i 

where the c, are the complexities and the 
hj? are the communalities. This para- 
metric duality creates a corresponding 
duality in the fitting procedures: We can 
specify the communalities first or we can 
specify the complexities first. The older 
method is to specify the communalities 
first. So I will begin there. 


Communality Method 


Using the communality method we fit 
a simplex in the following manner. 
First, we specify tentative values for the 
communalities. Then, the correlations 
are corrected for attenuation (by the 
communalities) and the resulting, unat- 
tenuated correlations fit with a perfect 
simplicial form.^! If the unattenuated 
correlations won't submit to a simplicial 
fit, it may be that the variables which 


A The term “perfect simplicial form" is 
defined on the next page. Fora fuller dis- 
cussion of the simplex and its terminology, 
see Jones (1959, pp. 11-15). 
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enter into the matrix are not structured 
simplicially. It may also be that we 
used the wrong communalities. So we 
adjust the communalities. If, by these 
adjustments, we can bring the unattenu- 
ated correlations into simplicial form or 
an approximation to it, we have our fit. 
If we can't, then the simplicial hypothe- 
sis must be rejected. 

It's quite obvious from this account 
that a great deal hinges on the original 
choice of communalities. If the matrix 
has simplicial form, then sooner or later 
we will find a fit. But if the values we 
posit for the communalities are suffi- 
ciently far off, it may be much later. It 
would be nice if we had some way of 
knowing at the beginning what these 
communalities would turn out to be. 
Unfortunately, in advance of trying 
them out, the best we can do is to specify 
limits for the communalities. It woul 
not be reasonable, for example, to set 
the communality of a variable higher 
than its known reliability. Nor woul 
it be resonable to set it so low that the 
correlation between two variables, after 
correcting for the communalities, became 
greater than unity. But within these 
limits there is no way of knowing in 
advance what the best-fitting values for 
the communalities are. 

Nevertheless, we can still posit values 
for the communalities and try them out. 
And thisisjust what wedo. In practice, 
there are three basic ways of proceeding: 
One, we can set the communality of the , 
ith variable equal to the highest correla- 
tion it has in the matrix. Two, we can 
equate the communalities with one kin 
or another of reliability (if we have anY 
information on this count). Or three 
we can set the communalities all equal to 
a given value, .90, say, or .80. No one o 
these procedures has any a priori claim 
to being right. But they do represent 
so many ways of approaching the pro?” 
lem, and in my experience they are g0° 
ways. I have fit a great many simplexes 
and one or another of these approaches». 
with minor adjustments, has always le 
to a good fit, if one existed. d 

In fitting the matrix for the Two-Hanc 
Coordination Test I set the communali- 
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ties of the trials equal to the largest cor- 
relation they had in the matrix. The 
next step was to correct every correlation 
in the matrix for its attenuation by the 
remaining, specific variance. In other 
words, I divided r;; by Nhah?. The ma- 
trix of unattenuated coefficients which 
resulted appears in Table A7. 

Now, once the correlations have been 
Corrected for attenuation the equations 
for the simplex become: 


mo = fiH fatt fea fs 
m o- 0 fateh teat te 
EL frat fa 
Xn = fa 


or just what they were originally, except 
that the specific factors are gone. In 
this form, the simplex lays down very 
demanding requirements on the correla- 
tions, If the unattenuated correlations 
fij’, really do form a simplex, then they 
uet obey the fundamental law of ‘‘per- 
ect simplicial form," i.e.: 


TU = rij’ Pik’ 


For example, if rı? = .90 and ras’ = .80, 
then. "i! must equal .72. Clearly, if 
this is the case, the matrix will have the 
Superdiagonal form—and then some. 

Of course, in practice the unattenu- 
ated coefficients never do have simplicial 
form, not exactly. Take the correlations 
of Table A7, for example. 71: times rex 
equals .93, which almost equals ris’ = 


TABLE A7 
UNATTENUATED EMPIRICAL CORRELATIONS 
FOR THE Two-HaND COORDINATION TEST 
IN THE COMMUNALITY METHOD 


Tis | xs | xs | Xe | Xe | Xe | 28 | Xe 
Xi| [9s| 91 | .87j -86) 83 | -83| .81 
x: "og | 98 94] 91 | 91| 91 
x 100 .98| .95 | .92| .93 
X 100| .96 | .93| .96 
Xs ‘97 | 98] .98 
es | 1.00| 1.00 

x 1.00 
x | 
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TABLE A8 


UNATTENUATED THEORETICAL CORRELATIONS 
FOR THE Two-HAND COORDINATION TEST 
IN THE COMMUNALITY METHOD 


Trial| Xi| X: | X3 | Xe | Xs | Xe | X: Xs 
Xi .93 | .90 | .88 | .86 | .83 | .83| .83 
Xs 97 | .95 | .93 |.90 | .90| .90 
Xi 98 | .96 |.93 | .93| .93 
X; .98 | .95 | .95| .95 
X; 971 .97| .97 
Xe 1.00) 1.00 
X: 1.00 
Xs 


91, but not exactly. Our task, there- 
fore, is to find a theoretical simplicial 
form which comes as close as possible to 
the unattenuated pattern. In the case 
of Table A7 this task is easy. The cor- 
relations, which appear in Table A8 have 
perfect simplicial form, and, as the reader 
can see, they constitute a very close ap- 
proximation to the unattenuated empiri- 
cal correlations. 

To obtain the theoretical correlations, 
Cii which I presented in Table 4 of the 
text I simply multiplied the correlations 
in Table A8 by the same communalities 
which I used to unattenuate the empiri- 
cal correlations. . 

But what about the complexities? 
Here we are, with the pi; already derived, 
and we still haven't said anything about 
the complexities. The answer, of course, 
is that, without realizing it, perhaps, we 
have already posited the complexities. 
The complexity of the ith trial is nothing 
other than its theoretical correlation, 
before attenuation, with the first trial in 
the series; in other words, cj = Cii. In 
the case at hand, therefore, the complexi- 
ties are the correlations in the first row 


of Table A8.^? 


A2 The virtue of the complexities is that, if 
we know them, then by the fundamental law 
of perfect simplicial form we know all of the 
cij and can, therefore, if we also know the 
communalities, generate all of the theoretical 
correlations pij. However, the complexities 
are not alone in possessing ‘this property. 
The gradations, ie, the correlations #41, 
are also capable of generating the entire set 
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Covariance Method^? 


The full equations for the simplex 
(with progressive simplification) are: 


xi = Ri(fit---+ fat si) 


where the k; are arbitrary constants. In 
the usual presentation of the simplex, as 
in the text of this paper, these constants 
are dropped, since they are of no con- 
sequence as long as we have to do with 
correlations only. The moment, how- 
ever, that we concern ourselves with 
variances or with covariances we must 
dredge these little things up and be most 
meticulous about them. 

Under a simplicial hypothesis, the co- 
variances between the variables: 


covi; = kikilos? + - oy) 
Suppose, now, that we multiply all of 
C A ko 
the entries in the ith row by A where 


ko is another arbitrary constant. The 


new entries: 
covij = kokios -+ op 


And if we multiply the columns by the 
same quantities, we get the entries: 


cov; = kloz? Te “+ os, 


From here we get the unattenuated, the- 
oretical correlations cj; immediately, 
because: 

dp (ore seb eps) 


" ko 
The only problem is to find the E 


i 


of c; and so are several other subsets of the 
cj. For a discussion of this point, see Jones 
(1959, p. 12). . 

43 The first person to mention to me the 
possibilities of the covariance matrix in fitting 
the simplex to a correlation matrix was Ardie 
Lubin. My original reaction was very nega- 
tive and in the “Simplex Theory (ones, 
1959, pp. 79-80) I had some tamer i ard 
things to say about his suggestione, „have 

o course now, therefore, but to apo logize to 
Wn for my intemperance: he was right and I 


was wrong. 
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TABLE A9 


THE COVARIANCE MATRIX FOR THE 
Two-Hanp COORDINATION TEST 


Trial} Xi | X2 | Xs | Xi | Xs | Xo | Xs | Xs 


X 323| 331 | 329 | 376 | 353 | 360 | 375 
X: 477 | 484 | 546 | 510 | 523 | 528 
Xi 536 | 617 | 575 | 580 | 584 
Xi 640 | 593 | 598 | 613 
X; 702 | 726 | 729 
Xe 718 | 722 
Xi 751 


Xs | 


This problem, happily, is easily solved. 
Once we have multiplied through the 


ko A 
rows by the y then all of the entries in 
i 


any one column are equal. So suppose 
we set all of the entries in the last column 
equal to some arbitrary amount A, by 
multiplying the rows by: 


A ko ( A ) 
s/p 
COVin — hi kay, 


If the simplicial hypothesis is correct, 
then all of the entries in any one of the 
other columns should be equal too, or 
close toit. Ifthisis not the case, we can 
reject the simplicial hypothesis. If it is, 


k 
however, we have the i we sought. 


i 

In Table A9 the covariance matrix for 
the Two-Hand Coordination Test is pre- 
sented. Setting all the entries in the last 
column equal to 300 (except for rounding 


k 
errors), we get these values for our =. 


ko k k k 

— = g4 LL A ers 2 = 40 
A E 51 ps E Fs 

ko k k 

fu gy Fel. S 

x R9 boy 

Note that we get no value for ie When 


8 
the rows are multiplied by these values, 
We get the matrix which appears in Table 
A10. And since the columns conform 
approximately to the simplicial require- 
ments, we know that we are on the right 


| 
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TABLE A10 


THE COVARIANCE MATRIX FOR THE Two- 
HAND COORDINATION TEST AFTER MULTI- 


k 
PLYING THROUGH THE ROWS BY E 
i 


Trial Xi| Xa | Xa | Xe | Xs | Xe | Xs | Xs 


X 271| 278 | 276 | 316 | 297 | 302 | 300 
X: 272 | 275 | 311 | 291 | 298 | 301 
Xi 273 | 315 | 293 | 296 | 298 
X. 314 | 291 | 293 | 300 
Xs 288 | 298 | 299 
ži 302 | 303 
7 
a 300 
track. Now, multiplying through the 


columns by these same values we get the 
matrix which appears in Table A11. 
And from this table, taking the average 
down the columns, we get: 


kj (ogg +++ ++ ogg) = 154 
ke (og +++ ogg) = 141 
ke (oy? +++ oy?) = 135 


klore H+ ogg) = 129 
Kj (ord bor? + aj) = 123 
klor? + org) = 119 


From these figures we immediately ob- 
tain the matrix of theoretical, unattenu- 
ated correlation coefficients (minus the 
first row and the last column) which ap- 
Pears in Table A12. It would have made 


TABLE All 


THE COVARIANCE MATRIX FOR THE Two- 


HAND COORDINATION TEST AFTER MULTI- 
PLYING THROUGH THE ROWS AND THE 


ko 


COLUMNS BY 5, 


Triall Xi| xi | xi | Xe | Xs | Xe | Xz | Xs 


X,| | 45a] 142 | 135 | 130 | 125 | 121 


X; 139 | 135 | 128 | 122 | 119 
Xi 134 | 129 | 123 | 118 
X, 129 | 122 | 117 
X, 121 | 119 
Xs 121 
Xi 

X, 


no difference, by the way, if we had 
equated the rows first and then multi- 
plied through the columns. We would 
have ended up with the same values for 
the cij. 

Let's not worry for the time being 
about the first row and the last column 
and proceed directly to the communali- 
ties. If a simplicial fit exists, we know 


that: 
Tij & 6; Nhoh? 


If, therefore, we divide the r;; by our al- 
ready obtained c;; the resulting matrix 
should be a Spearman unit hierarchy, 
where the loadings for the several vari- 


TABLE A12 


UNATTENUATED THEORETICAL CORRELATIONS 
FOR THE Two-HAND CooRDINATION TEST 
IN THE CovARIANCE METHOD 


Xs 97 | .96 
Xe .98 
X: 
Xs 


ables equal Vie. Carrying out this 


process we find that: 
he = .92 
he = .94 


he = .94 
h? = 94 


he = .92 
h? = .92 


It only remains to fill things in for the 
first and last variables. 
The correlations in the first row: 


Ti C1209; Vhh’ 
nj = hes Nh? 


1, = Cia Vie 


or 


if we set 


In other words, given what we already 
know, i.e., cs; and Vh’, for2 janm 
the empirical correlations rı; are gener- 
ated by a single parameter. In the same 
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way, the empirical correlations in the last 
column: 

Tin = lcin- Vhè 
where is 

In = Caan Vhr? 


and finally 
Tin = lilnc27 


All we have to do, therefore, is to obtain 
tentative values for /; and /,. The most 
natural way to do this is to set: 


fx 
1, = TDs 
Vin? 


and then solve for J; and Zp. 
ent case, we get: 
lı = .82, 
ls = .97 


Altogether, then, we can pick any values 
we like for ci» and h,?, provided only that: 


In the pres- 


c12 Vh? = .82 


And in the same way we can pick any 
values we like for cz and hg, provided 


only that: 
cnm Vhè = .97 


Suppose, therefore, that we set: 


C12 = .90 Cig = .99 
h? = .81 h? = .96 
Remembering that the complexities are 


simply the c;j, we now have both our com- 
munalities and our complexities, and we 
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TABLE A13 


ATTENUATED THEORETICAL CORRELATIONS 
FOR THE Two-HaNp COORDINATION TEST 
IN THE CovARIANCE METHOD 


Trial) Xı| X: | Xa | X. | Xs | Xe | Xz | Xs 


Xi 9| .75 | .73 | .72 | .70 | .69 | .69 
X: -88 |.86 | .86 | .83 | .82 | .82 
Xi 90 | .89 | .86 | .86 | .86 
X, 91 |.88 | .87 | .87 
X; 91 | .90 | .90 
Xs .92 | .92 
Xi 94 
Xs 


can generate our theoretical correlations. 
The pi; for the Two-Hand Coordination 
Test appear in Table A13, and, as the 
reader can see, we have a very good fit. 
Also, if the reader will check the param- 
eters obtained by this method with 
those obtained by the communality 
method, he will see that they are essen- 
tially the same. 

In conclusion, let me say a word or 
two about the comparative merits of 
the communality and the covariance 
methods. Ordinarily, the communality 
method is quicker and involves many 
fewer computations. But it can go 
awry. If the communalities of the vari- 
ables happen not to coincide, even ap- 
proximately, with the tentative values 
Which are given us by one or another of 
our three methods of approach, we can 
waste an awful lot of time looking around 
for a fit. The covariance method, in 
comparison, is sure. If a fit exists. you 


can always get it with the covariance 
method. 


i 
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NORMATIVE AND IPSATIVE MEASUREMENT 
IN PSYCHOLOGY: 


DONALD M. BROVERMAN 


Worcester State Hospital, Massachusetts 


In an earlier paper (Broverman, 
1961) certain differences between fac- 
tors obtained from standard R factor 
analysis methods (correlating tests) 
and Q factor analysis techniques (cor- 
relating people) were illustrated and 
attributed to the different types of 
Scores used by the two techniques. 
The usual R analysis uses "normative" 
Scores, or scores which reflect devia- 
tions of individual performances from 
the group's mean level of performance; 
while Q factor analysis typically uses 
ipsative" scores, or scores which re- 
flect deviations of particular perform- 
ances from each individual's mean level 
of performance. Normative scores, 
then, are distributed about a norm 
Characterizing a group, while ipsative 
Scores are based on norms of indi- 
viduals. 

The present paper will attempt to 
show that normative and ipsative pos- 
Sibilities of measurement occur not 
Only in factor analysis, but, also, in 
most efforts to quantify behavior, 
whether in purely theoretical terms or 
in actual experimental situations. Ac- 
Cordingly, the assumptions involved in 
each mode of measurement, and exam- 
Ples of these assumptions in current 
Psychological theories will be pointed 
Out. The paper will then examine the 
manner jn which standard statistical 


1 Preparation of this paper was facilitated 
y the Dementia Praecox Research Project, 
orcester State Hospital, and research 
Brant (M-896), from the National Institute 
Of Mental Health, United States Public 
ealth Service. The writer prepared this 
Paper while a United States Public Health 
9stdoctoral Fellow in Clinical Psychology. 


procedures cope, or fail to cope, with 
these different aspects of behavior. 
With respect to this last point, the 
classical correlational procedures will 
be shown to confound normative and 
ipsative aspects of behavior. Hence, 
the final portion of this paper will pre- 
sent correlational techniques for deal- 
ing with these distinctions. 


ASSUMPTIONS OF THE NorMATIVE 
MEASUREMENT ORIENTIATION 


The normative measurement orienta- 
tion assumes that behavioral variations 
between individuals may be ordered on 
parameters which exist within the 
species' universe of variation. Thus, 
normative measurement procedures 
distribute scores about the norm ofa 
group, sample, or population. In 
normative measurement, then, the be- 
haviors of individuals are ordered on 
dimensions which transcend the indi- 
vidual, ie., the behavior of a given 
individual is but a point or mark on a 
normative scale. This approach as- 
sumes that certain individual differ- 
ences in behavior are due to an entity 
or trait which all members of the popu- 
lation possess in varying amounts. For 
example, Spearman (1927) assumes 
that individual differences in intelli- 
gence reflect the different amounts of 
mental energy possessed by different 
individuals. 

It should be noted that intraindi- 
vidual variation in parts of normative 
traits are not considered in normative 
measurement. Accordingly, less than 
perfect correlations between several 
manifestations of the same normative 
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trait are accounted for by assuming 
(a) that the surface behaviors showing 
the trait are phenotypically complex 
manifestations of more than one un- 
derlying genotypical, normative trait, 
and/or (b) that part of the discrepancy 
is due to measurement error. Thus, 
normative dimensions are treated as 
unitary, internally undifferentiated 
variables whose variation occurs across 
individuals. 


ASSUMPTIONS OF THE IPSATIVE 
MEASUREMENT ORIENTATION 


The ipsative measurement orienta- 
tion, on the other hand, assumes that 
each individual is, himself, a universe 
within which variations in behavior 
occur. Accordingly, the different be- 
haviors of an individual are ordered on 
dimensions or scales within each indi- 
vidual, e.g., the habit strengths of dif- 
ferent responses in an individual may 
be scaled and compared. Particular 
behaviors of the individual, then, are 
assigned marks or scores on ipsative 
scales. 

The ipsative measurement approach 
frequently assumes that relationships 
between psychic entities in an indi- 
vidual determine his behavior. For in- 
stance, Spence (1954) argues that 
when incompatible response tendencies 
are simultaneously aroused in an indi- 
vidual, the response possessing the 
strongest habit strength will be the one 
the individual will make. 

It should be noted that often the 
ipsative orientation is not concerned 
with the unique aspects of single indi- 
viduals as in the idiographic method 
advocated by Allport (1937). On the 
contrary, ipsative relationships are fre- 
quently ascribed in theory to an entire 
population, i.e., the rules or laws [a 
tween internal psychic events may be 

generalized to all humans. Thus, it is 
rok contradictory to seek commonly 
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occurring ipsative relationships. In- 
deed, this is the goal of many personal- 
ity theories. 

The ipsative measurement orienta- 
tion, then, is concerned with quanti- 
tative relationships between psychic 
functions within individuals. It fre- 
quently assumes that such intrapsychic 
relationships play an important role in 
the determination of behavior. 


Ipsative MEASUREMENT PROCEDURES 


A property of all scales, both ipsative 
and normative, is that the scale scores 
must be in the same units. Hence, in 
ipsative measurement, the various be- 
haviors of a given individual must be 
expressed in terms of some common 
unit if scale comparisons are to be 
made. Numerous examples in psycho- 
logical theory illustrating this pro- 
cedure are available. For instance, 
Spearman (1927), who maintains that 
intelligence has certain subfunctions 
Such as eduction of relationships, re- 
production, etc., posits that a large 
expenditure of mental energy in one 
subfunction leaves less energy available 
for other subfunctions. In this ex- 
ample, the common unit of the dif- 
ferent subfunctions is mental energy. 
The energy levels of the two subfunc- 
tions, therefore, may be scaled and 
compared. Similarly Freud (1935) 
posits conflict between different psy- 
chic institutions, e.g., id versus ego, the 
outcome of which depends upon their 
respective strengths. Once more the 
common denominator is some sort of 
mental energy, 

Thus, in ipsative scaling procedures, 
different behaviors are reduced to, oF 
expressed in terms of some common 
unit, and vary around a norm estab- 
lished within the individual. The 
norm, of course, is also expressed in 
these same units. p 

Different ways of establishing ipsative 
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norms within individuals exist. One 
method is by working with single, iso- 
lated individuals. For example, in 
Stephenson's (1953) Q sort each indi- 
vidual produces his own distribution of 
responses to a set of stimuli. The 
typical unit of variation in such Q sorts 
is the degree of “likeness to me.” An- 
other example would be measures of 
physiological reactivity under different 
conditions which are compared to a 
base-rate of that activity. In these ex- 
amples, the norms are derived solely 
from a given individual's behaviors. 
What other individuals do has no ef- 
fect upon the scales or norms so estab- 
lished. Hence, this may be called a 
pure" ipsative measurement method. 

A second way of establishing ipsative 
norms within individuals starts with 
normative considerations. For in- 
Stance, the performances of a number 
of subjects might be normatively 
ranked for each test in a test battery. 
A given subject would then have a 
distribution of normative ranks—one 
rank for each of the tests in the battery. 
The subject’s general level of perform- 
ance for the battery, i.e., the subject's 
average normative rank standing, 
would then constitute the subject's 
ipsative norm and deviations of the 
Subject's normative rank standings on 
particular tests from this ipsative norm 
Would represent ipsative scores. The 
ipsative scores, in this case, are in units 
of deviations of group standings on 
various tasks from the subject’s aver- 
age group standing. Ipsative scores 
derived in this manner may be termed 
“double-centered” scores, i.e., they are 
Centered or distributed first about a 
group norm, then about a norm of the 
individual. 

"Thus, unlike either the pure ipsative 
method or the normative approach, 
the double-centered ipsative approach 
takes into consideration both normative 
and ipsative aspects of variation. 


Moves or MEASUREMENT VERSUS 
PARAMETERS OF BEHAVIOR 


Measurement operations refer to cer- 
tain ways of assigning values to events 
or data on hypothetical parameters. 
Measurement-wise, however, one may 
develop several different techniques for 
establishing such values, or scores, on 
a given hypothetical parameter. Thus, 
one may distinguish between the con- 
cept of "modes of measurement" and 
the hypothetical parameters upon 
which these measurement techniques 
are intended to produce scores. The 
lack of equivalence between particular 
measurement or scaling procedures 
and the hypothetical parameters upon 
which score values are desired may be 
illustrated by considering a situation 
where certain modes of measurement 
are inadequate for the task at hand. 
For instance, we have previously men- 
tioned that the normative measurement 
orientation does not consider intra- 
individual variations in subdomains of 
normative traits. Similiarly, the pure 
ipsative approach does not attempt to 
work with normative variations be- 
tween individuals. However, if one ac- 


cepts the notion that a given set of per- 
both 


formance scores may reflect 
normative and ipsative sources of 
variation, then both of the above 


measurement orientations become in- 
adequate as procedures for assigning 
appropriate normative and ipsative 
scale values to the data. Rather, 
some method which takes into account 
both sources of variation should be 
employed. Thus, the hypothetical 
parameters on which the data are to be 
given scale values are seen to be con- 
ceptually independent or superordinate 
to the particular manner in which the 
various marks on the parameters are 
developed. Hypothetical normative 
and ipsative behavioral parameters, 
then, represent two distinct and sepa- 
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rate areas of concern for theories of 
behavior. It is not surprising that 
theories of psychology have progressed 
down both of these avenues of inquiry. 


NORMATIVE AND IPsATIVE 
PsyCHOLOGICAL THEORIES 


Modern psychology includes both 
normatively and ipsatively oriented 
theories. For instance, Krech's (1949) 
theory that differences in intelligence 
are due to different rates of cortical 
conductivity seems, essentially, a norm- 
ative theory. The dimensions involved 
in Sheldon's (1944) body types also 
seem to be true normative parameters, 
as do the various traits of personality 
with which Allport (1937) has been 
concerned. 

However, most theories concerning 
the mechanisms of psychological func- 
tioning assume that behavior is a 
function of relationships between psy- 
chological events occurring within a 
given individual. For instance, Hel- 
son’s (1948) Adaptation Level Theory 
suggests that a perceptual response is 
determined by the relationship of the 
present stimulus to previously experi- 
enced stimuli. Similarly, Werner 
and Wapner’s (1952) Sensory-Tonic 
Theory stresses the role of the relation- 
ship of sensory to tonic factors in de- 
termining perception. Thus, the above 
perceptual theories, many learning theo- 
ries (Hull, 1943; Thorndike, 1932), 
and general personality theories 
(Freud, 1935; Lewin, 1935) all em- 
phasize the importance of relationships 
between psychic events in determin- 
ing behavior. These theories, then, 
are essentially ipsative in nature. Of 
course, many theories may include both 

rmative and ipsative considera ions. 
Pur instance, while Freud (1935) 1s 
primarily concerned with the relation- 
ships between psychic institutions, he 
also postulates normative, between in- 
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dividual differences in endowments of 
libidinal energy. Conversely, while 
Spearman (1927) is perhaps best 
known for the normative aspects of his 
theory, i.e., differences’ in intelligence 
are due to different amounts of mental 
energy, his theory also includes 
formulations about ipsative relation- 
ships between subfunctions of intel- 
ligence. These theories, and quite 
likely others, then, contain both norma- 
tive and ipsative formulations. 

It should also be noted that the 
normative-ipsative distinction is not 
represented in the language of indi- 
vidual differences. Individual differ- 
ences are usually thought of only in 
a normative sense, ie, in Anastasi 
and Foley (1949) differences between 
individuals are thought of as “a matter 
of degree. Tt is in this sense that indi- 
vidual differences are said to be quanti- 
tative rather than qualitative" (p. 59). 
However, this orientation tends to 
obscure the possibility that certain 
between-individual differences may be 
a function of varying ipsative, within- 
individual relationships, e.g., a psycho- 
analyst might argue that the differ- 
ences between psychotics and normals 
are due to different intrapsychic rela- 
tionships between ego and id. On the 
other hand, individual differences stem- 
ming from differing ipsative relation- 
ships can also be placed on normative 
Scales, e.g., the above analyst might 
also say that one patient is more or less 
psychotic than another, Terminology 
is needed, then, to distinguish those 
individual differences which are due 
solely to variations on normative 
parameters from individual differences 
stemming from differing ipsative rela- 
tionships. Hence, it is proposed that 
the purely normative individual dif- 
ferences be termed “first order” norma- 
tive differences, while those individual 
differences stemming from ipsative, 
within-individual relationships might 


i 
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be termed "second order" normative 
differences. 

It seems, then, that there are two 
legitimate, but independent realms of 
inquiry for a general psychology of 
individual differences. These are (a) 
the establishment of those normative 
parameters of differences within the 
species that are not merely pheno- 
typical expressions of ipsative internal 
relationships, and (b) the establish- 
ment of ipsative parameters of internal 
organization, As already demon- 
strated empirically, the generalizations 
that hold at one level of analysis can 
differ from those that hold at the 
other level (Broverman, 1960, 1961; 
Mandler, 1959). 


NORMATIVE AND ÍPSATIVE 
MEASUREMENT IN STATISTICS 


Relatively little explicit attention is 
currently given to the distinction be- 
tween normative and ipsative measure- 
ment.? Consequently, numerous in- 
consistencies in the applications of 
these orientations exist. For instance, 
most psychologists seem to prefer 
normative approaches to research 
Problems even though many, if not 
most, of present day psychological 
theories lean heavily on ipsative 
conceptualizations. The development 
of statistical procedures for separating 
these different aspects of behavioral 
data has also been uneven. Some 
Statistical techniques discriminate be- 
tween normative and ipsative aspects 
of data while others do not. The fol- 
lowing two hypothetical examples, in- 
volving typical research problems and 
Statistical procedures, illustrate (a) an 
appropriate separation of normative 
from ipsative aspects of data, and (b) 
à confounding of these two types of 
information. 


2 An outstanding exception is Cattell's 


(1944) paper concerning the use of norma- 


tive and ipsative scores in psychology. 


The first example involves assessing 
the significance of the difference be- 
tween two means. An investigator 
wishes to determine whether motivat- 
ing instructions improve performances 
on a particular task. Hence, the ex- 
perimenter administers the task under 
a motivated and a nonmotivated condi- 
tion to the same group of subjects. 
Assuming sequence effects are con- 
trolled, the experimenter might pro- 
ceed to assess the significance of the 
difference between condition means via 
an analysis of variance technique. 
Since the experimental design employs 
repeated measurements, the total vari- 
ance would first be partitioned into 
"between" . and “within” individual 
sources of variation. The within 
individual variance would then be fur- 
ther divided into that associated with 
variation in conditions, and that which 
is not, After division by the appropri- 
ate degrees of freedom, the ratio of the 
two portions of within-individual vari- 
ance would constitute an F test. If 
this test attains the required level of 
significance the experimenter is able to 
say that “The mean performance under 
the motivated condition is significantly 
different from the mean performance 
under the nonmotivated condition.” 

This statistical procedure clearly 
separates the normative from the 
ipsative aspects of the data. The 
between-individual variance _ reflects 
normative interindividual differences 
in task ability, independent of condi- 
tions, and is appropriately discarded 
since this variance is not the point of 
inquiry. The experimenter is not con- 
cerned with ability differences between 
subjects. He wishes only to know the 
effects of conditions on the subjects 
after such ability differences are con- 
trolled. The within-individual vari- 
ance, which reflects ipsative or intra- 
individual variation, on the other hand, 
reveals that the experimental conditions 
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had significant effects upon perform- 
ance when these effects were measured 
against each subject's own norm. This 
statistical procedure, then, sensitively 
discriminates between normative and 
ipsative aspects of data. 

In contrast, correlational analyses 
typically give little attention to the 
normative-ipsative distinctions in be- 
havior. Consequently, correlational 
procedures tend to confound inter- 
individual with intraindividual varia- 
tions in performance scores as illus- 
trated in the following example: 

The experimenter wishes to know 
the correlations between several tasks. 
Accordingly, he administers each task 
to each subject in his subject sample 
and correlates the various Scores, 

Table 1 presents hypothetical stand- 
ardized scores for 12 subjects on four 
tasks. Table 2 shows that the Pearson 
product-moment — intercorrelations of 
these tasks are positive, ranging from 
.69 to .93. This standard correlational 
procedure accepts these correlations as 
the relationships between test perform- 
ances, which, indeed, they are. How- 
ever, in the light of the previous dis- 
cussion, we know recognize that these 
relationships between test perform- 


TABLE 1 


HYPOTHETICAL PERFORMANCE Data 
STANDARDIZED Scores 


Mean 
Test 4 | standard 
score 


Subject | Test 1 | Test 2 


67 | 62 | 58 | 55 | 60.5 
62 67 55 58 60.5 
58 | 55 | 62 | 67 | 60.5 
PI 

51 49 4 . 
35 50.0 
49 | 47 | 51 | 53 | 50.0 
47 49 53 51 50.0 
38 | 33 | 39.5 
42 45 33 38 39.5 
38 33 42 45 | 39.5 
33 | 38 | 45 | 42 | 39.5 
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TABLE 2 
INTERCORRELATION OF TESTS 


Test Test 2 Test 3 Test 4 
Test 1 93 69 69 
Test 2 69 69 
Test 3 93 
Test 4 


Note.—Decimals have been omitted. 


ances are a function of two sources of 
variation: (a) normative differences 
between individuals in general ability 
levels, and (b) within-individual ipsa- 
tive variations in abilities on different 
tasks. The correlation coefficients in 
Table 2; then, are confounded result- 
ants of these two sources of variation. 
This is the same confounding that the 
experimenter took pains to avoid when 
assessing the difference between condi- 
tions means. The analogue of the 
mean difference technique in correla- 
tional procedures would be to eliminate 
normative differences in general ability 
level between individuals before exam- 
ining the intraindividual variations that 
occur as a function of differences in 
task requirements. The more precise 
experimental question would then be 
“What are the relationships between 
abilities on different tasks when dif- 
ferences in general ability levels be- 
tween subjects are held constant of 
statistically controlled ?" As will be 
shown below, these relationships may 
be quite dissimilar to those obtained 
from the usual confounded method. ; 

Our problem is to separate the vari- 
ance attributable to normative differ- 
ences between subjects from that due 
to within-individual variations in ability 
to perform different tasks. 

Normative variance between sub- 
jects occurs when some subjects tend 
to be generally above the group mean, 
while others tend to be below this refer- 
ence norm. However, if the task per- 
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TABLE 3 


HYPOTHETICAL PERFORMANCE DATA TRANS- 
FORMED INTO [PSATIVE SCORES 


(X; — XQ 

subject Test 1 Test 2 Test 3 Test 4 
1 6.5 1.5 | —2.5 | —5.5 
2 1.5 65 | —55 | —2.5 
3 —2.5 | —5.5 1.5 6.5 
4 —5.5 | —2.5 6.5 1.5 
5 3.0 10 | —1.0 | —3.0 
6 1.0 30 | —30 | —1.0 
7 —10 | —3.0 1.0 3.0 
8 —30 | —1.0 3.0 1.0 
9 5.5 2.5 | —1.5 | —6.5 
10 2.5 5.5 | —65 | —1.5 
11 -1.5 | —6.5 2.5 5.5 
12 —6.5 | —1.5 5.5 2.5 


formances of each subject are ex- 
pum as ipsative scores, or deviations 
m Ne subject's mean level of per- 

nce, then all subjects will have 
the same status with respect to the 
new reference norms, i.e., the sum of 
the deviations above each ipsative norm 
Will equal the sum of the deviations 
below for each subject. Hence, norma- 
tive differences between subjects are, 
in effect, removed via ipsatization of 
performance scores. Correlation of the 
Ipsative scores will then reveal the in- 
trapsychic relationships between abili- 
ties on different tasks, free from the 
Influence of differences in general 
ability between subjects. 

The mean standard score of each 
Subject is shown in Table 1. Table 3 
Shows the ipsative scores (X; — Xi) of 
Sach subject on each task. Table 4 
Shows the intercorrelations of these 
Ipsative scores, It will be noticed that 
Some of the correlations are now nega- 
live, others positive. Thus, controlling 
Normative interindividual differences in 
Seneral ability levels considerably alters 
the correlational relationships between 

asks, and, of course, any conclusions 
that might be drawn from these data. 


Instances where such controls have 
actually affected the results of studies 
have been reported by Mandler (1959) 
and Broverman (1960, 1961). 

It seems clear then, that correla- 
tional analysis, a classical research tool 
in systematic psychology, has suffered 
from a lack of sensitivity to the 
normative-ipsative distinctions in be- 
havior. Standard R factor analysis 
technique, which is an elaboration of 
the correlational method, suffers in the 
same manner, i.e., the factors extracted 
by these techniques are confounded ex- 
pressions of normative and ipsative 
variation. The confounding in R factor 
analysis can be overcome in the same 
manner as illustrated above, ie, by 
ipsatizing the various performance 
scores before starting correlational and 
factoring procedures. As in the above 
correlational example, factors derived 
from ipsative scores will frequently 
have large negative loadings or correla- 
tions with tests. A discussion of the 
numerical mechanics underlying this 
phenomenon may be found in Brover- 
man (1961). That paper also pointed 
out that an ability factor defined by 
both positive and negative loadings can 
be viewed as a “choice point” in de- 
velopment since at some point in each 
individual's development a decision 
must be made as to whether he is to 
specialize in one or the other of the two 
classes of behaviors defining the factor. 
Other investigators (Cattell & Bag- 
galey, 1956; Sweney, 1959) have inter- 
preted this specialization or directional 


TABLE 4 
INTERCORRELATIONS OF IPSATIVE SCORES 


Test Test 2 Test 3 Test 4 
Test 1 58 -79 —79 
Test 2 =19 —79 
Test 3 58 


Note.—Decimals have been omitted. 
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aspect of ipsative scores as an index 
of motive strength. 


RELATIONSHIP BETWEEN NORMATIVE 
PARAMETERS AND PARAMETERS Dr- 
RIVED FROM DOUBLE-CENTERED 
IPsATIVE SCORES 


The relationship between dimensions 
derived from ipsative scores distributed 
about a normative base-line, and the 
normative dimension itself is worthy of 
particular note. Such ipsative dimen- 
sions are internal subdivisions or dif- 
ferentiations of the larger normative 
dimension. Again, Spearman's (1927) 
theory of intelligence illustrates this 
point. Spearman argues that intel- 
ligence varies in units of mental energy. 
Normatively, then, the subjects differ 
from each other in terms of the amount 
of mental energy possessed by each. 
However, within intelligence, as men- 
tioned earlier, Spearman posits sub- 
functions of eduction, reproduction, 
etc, the strength of which are also 
measured in terms of mental energy 
and which vary ipsatively within each 
individual. These subfunctions, then, 
are internal differentiations within the 
larger normative trait of intelligence. 

This same part-whole relationship 
between ipsative and normative factors 
is obtained when the ipsative dimen- 
sions are empirically derived from 
double-centered ipsative scores via fac- 
tor analysis. For instance, a subject's 
average rank performance for a battery 
of cognitive tests might be interpreted, 
normatively, as an indication of his 
general level of intelligence. „On the 
other hand, the factors emerging from 
an analysis of the ipsative scores dis- 
tributed about the average rank stand- 
ings of subjects would represent in- 
ternal differentiations of that normative 
A idw part-whole relationship of ipsa- 
tive to normative factors may we con- 
trasted with the relationships between 


' other, 
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factors produced by the usual norma- 
tive R factor analysis procedures. 
Factors produced by this latter method 
are discrete, and juxtaposed externally 
to each other. Even though normative 
R factors may be correlated with each 
one normative factor is not 
thought of as a subdivision of another, 
as in the double-centered ipsative ap- 
proach, 

As pointed out by Stephenson 
(1953), factor analysis is typically used 
in an exploratory way to determine the 
various  interdependencies between 
psychological functions or behaviors. 
When statistical procedures are used 
in this exploratory manner, one tends 
to allow the model embodied in the 
statistical procedures to guide one's 
conception of the organization of the 
behavior reflected in the data. Thus, 
the 30 years of:research employing 
normative R factor analysis, with its 
model of externally juxtaposed factors, 
has left its mark on present conceptions 
of mental functioning, eg. Burt’s 
The Factors of the Mind (1946), 
Thurstone’s Primary Mental Abilities 
(1938). While the part-whole model 
embodied in double-centered ipsative 
measurement is not new on the pIa 
chological scene—witness Spearman $ 
(1927) theory of intelligence—a clearer 
recognition of its properties may exert 
a correspondingly greater influence 
upon our conceptions of the organiza- 
tion of behavior, 


DETERMINATION OF APPROPRIATE 
Normative Basr-LiNES 


A major problem in double-centered 
ipsative measurement is the identifica- 
tion of appropriate normative base- 
lines from which to develop ipsative 
Scores, So far, we have assumed that 
a single normative base-line may be 
used for a given matrix of data. This 
assumption is implicit in the analysis 
of variance repeated measurements de- 
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Sign. However, it is possible that 
more than one normative variable, 
hence, more than one base-line, is 
actually embedded in a given matrix 
of data. In this case, certain behaviors 
ought to be ipsatively measured against 
one base-line, others against another. 
Hence, a method for objectively estab- 
lishing the presence of normative vari- 
ables is needed. This problem may be 
resolved via the following considera- 
tions: 

Any single performance score may 
be thought of as a resultant of norma- 
tive and ipsative sources of variation. 
Consider, now, a situation where sev- 
eral tests all reflect a single normative 
trait, but some of these tests also reflect 
One ipsative dimension, other of these 
tests reflect a second ipsative dimen- 
Sion, still others reflect a third ipsative 
dimension, etc. Each test score of a 
given, subject, then, consists of a con- 
Stant representing his status on the 
normative dimension and a variable 
representing the standing of the task 
On a particular ipsative dimension. 
Since this normative constant varies 
between subjects, it tends to produce 
Positive relationships between tests. 
The ipsative components of the various 
performance scores, on the other hand, 
Will either correlate positively when 
they reflect the same ipsative dimen- 
Sion, or not correlate when they reflect 
different ipsative dimensions. Still 
other ipsative components may be in- 
Versely related and work towards nega- 
tive relationships. Thus the normative 
Components tending to produce posi- 
tive relationships between tests will 
Summate with those positive ipsative 
Covariations, but will be weakened or 
Canceled out by those independent and 
Negatively varying ipsative components. 

he first order (simple structure) 
factors extracted from performance 
Scores, then, will be those tests whose 
Ipsative components have positive rela- 


tionships augmented by their positively 
covarying normative components. 

Since each of these factors have the 
same normative components, only the 
variation of the ipsative components 
can be acting to produce the different 
factors. Also, since each factor has 
the same normative component, these 
factors should correlate with each 
other. Accordingly, the factor inter- 
correlations ought to produce a second- 
order factor reflecting the presence of 
the normative trait. Of course, if the 
data actually contained more than one 
normative trait, then more than one 
second-order factor would be obtained. 
The number of second-order factors 
obtained from a factorial solution of 
performance data, then, indicates the 
number of normative dimensions pres- 
ent in the data. In turn, each norma- 
tive dimension calls for its own base- 
line from which to assess ipsative 
variations of the tests included in the 
second-order factor. Thus, the mean 
group standing of all tests included in 
each second-order factor would be ob- 
tained for each subject. Deviations of 
individual tests from these means 
would then reflect ipsative variations 
within a given normative dimension. 
Other methods of obtaining appropri- 
ate base-lines for ipsative oriented . 
research problems may be found in 
Broverman (1960). 


CONCLUSION 


It may be well to conclude by at- 
tempting to see the issues discussed in 
this paper in a broader perspective. It 
seems that the measurement of be- 
havior is more complicated than pre- 
viously suspected. One wonders why 
this should be so; why one can not 
proceed to a more straightforward 
measurement of behavior. The reasons 
seem at least three-fold: 

First, human behavior is so bewil- 
deringly varied and multifaceted that 
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few psychologists are willing to stay at 
the level of raw, uncategorized, ob- 
servations of behavior. Such an en- 
terprise would produce an infinity 
of parameters, variables, and scales. 
Rather, one typically seeks to reduce 
a large number of different behaviors 
to a smaller number of common under- 
lying parameters. The pertinent vari- 
ables of behavior, then, are not acces- 
sible for direct measurement. One 
must fractionate, peel off, and discard 
the surface aspects of different be- 
haviors in order to uncover and obtain 
an estimate of the underlying variables. 
Thus, in measuring behavior, pheno- 
typical surface appearances are dis- 
carded in favor of inferred genotypical 
variables. 

A second aspect of human behavior 
which contributes to the complexity of 
its measurement is that any given be- 
havior can Simultaneously represent 
more than one underlying variable. A 
given behavioral act may be a com- 
posite expression of many different 
variables. "Thus, one must Separate 
not only phenotype from genotype, but 
also genotype from genotype. 

Finally, we must recognize that all 
human behavior potentially participates 
in two different realms of human varia- 
tion, i.e., species and individual. Each 
behavioral act is, potentially at least, a 
simultaneous expression of the human 
species' as well as of the individual's 
repertoire of variation. It is this dual 
aspect of behavior which produces the 
particular normative-ipsative — model 
elaborated in this paper. Failure to 
represent this fact in measurement and 
statistical operations must lead to a 
confounding of these two different 
aspects of human existence and varia- 
tion. If one accepts the Proposition 

that the laws or regularities „which 
characterize variations in behavior be- 
tween individuals are not the ie. as 
those which characterize variations in 
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behavior within individuals, then such 
confounding is to be studiously avoided. 
Hence, an orderly inquiry into the 
nature of human behavior requires a 
stringent separation of the between- 
individual species parameters from the 
ipsative intraindividual organizational 
aspects of behavior in order that the 
Separate relational networks of each 
realm may be established. While this 
may involve considerable more work 
than previous methods, the results may 
be proportionately more rewarding. 
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FRUSTRATIVE NONREWARD IN PARTIAL 
REINFORCEMENT AND DISCRIMINATION LEARNING: 


SOME RECENT HISTORY AND A THEORETICAL EXTENSION! 
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University of Toronto 


The overall objective of this paper 
is to examine the current status of a 
concept of frustrative nonreward in 
behavior theory. The examination 
takes two forms: a general discussion 
and survey of recent work on mo- 
tivating and inhibiting properties of 
nonreward, followed by a theoretical 
extension of  frustrative-nonreward 
theory to certain prediscrimination 
phenomena. 

In the first part of the paper a brief 
historical account will be presented of 
recent changes toward an "active" 
conceptualization of nonreward, par- 
ticularly in neo-Hullian theory, the cur- 
rent status of such an active nonreward 
concept in explanations of partial re- 
inforcement effects will be set in a 
broader framework of partial rein- 
forcement theories, and some recent 
data will be presented in support of the 
idea that discrimination learning in- 
volves frustrative-nonreward effects, 

The second more theoretical portion 
of this paper stems from an earlier 
frustrative conceptualization of non- 
reward (Amsel, 1958). It represents 
an attempt to extend frustrative-non- 
reward theory to situations in which 
discrimination learning is preceded by 
various amounts and kinds of "prior 
experience." These prior experiences 


1A shorter version of obe e d 

nted to the Psychology Colloquium of the 
University of Illinois in May 1961. The 
preparation of this paper and much of the 
experimental research to which it refers 
were supported by grants (G-5527 and 
G-13895) from the National Science Founda- 


tion. 


will be termed “prediscrimination 
treatments” and will involve training 
to approach in the presence of one or 
both (separately) of the eventual 
discriminada, under conditions of 
partial or of continuous reinforcement. 


History 


Hull’s adoption of what he termed 
the Mowrer-Miller hypothesis as à 
basis for a two-factor theory of inhibi- 
tion (Hull, 1943) was, at the same 
time, the acceptance of a “passive” con- 
ception of nonreward and rejection 
of a more active conception which 
he and others had held. Certainly, 
Pavlov's nonreinforcement, prodücing 
internal inhibition, was an active con- 
ception. Spence’s (1936, 1937) theory 
of discrimination learning, clearly in 
the Pavlovian tradition, also assigne 
special inhibitory properties to non- 
reinforcement. In some of his more 
recent systematic writing, Spence 
(1956) points out that his position on 
the Mowrer-Miller hypothesis was 
never in agreement with Hull's. 

Emotional-motivational effects O 
nonreward, as they relate to learning» 
were described in an article by Miller 
and Stevenson (1956), which attrib- 
uted the agitated behavior of rats dur- 
ing extinction to effects of nonrein- 
forcement carried over from one tri 
to the next. This is an example 9 
an active motivational conception ° 
nonreward. A similar emotional-mott- 
vational interpretation of nonrewar' 
can be found in Skinner (1938). a 

In Hull's 1943 theory, which dom! 
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FRUSTRATIVE NONREWARD 


nated the psychology of learning in the 
forties, nonreinforcement was an event 
without direct inhibitory or motiva- 
tional effect as it was also in the 
early mathematical-model treatments 
of learning (Bush & Mosteller, 1951; 
Estes, 1950). By about 1950, how- 
ever, expressions of dissatisfaction 
with the passive theoretical status of 
nonreward were emerging. These ex- 
pressions were in theoretically oriented 
experiments and in theoretical papers. 
Most of the early defections from 1943 
Hull, in this connection, were by per- 
sons close to the Hullian point of view. 
There are suggestions that Hull, him- 
self, was getting ready to change his 
Position on this shortly before his 
death (Hull, 1952, Ch. 5). 

, Rohrer's (1949) treatment of ex- 
ünction in terms of a "frustration 
drive" was one of the first expressions 
of a desire to give nonreinforcement a 
more active role in inhibition? How- 
ever, while Rohrer's reference to frus- 
tration was specifically related to the 
Operation of nonreinforcement, it pro- 
vided no particular mechanism for 
frustrative inhibition. At about the 
Same time, the well-known experi- 
ments of Virginia Sheffield (1949, 
1950) appeared, relating partial re- 
inforcement acquisition and extinction 
effects to the spacing of trials and 
Containing suggestions of both frustra- 
tion drive and frustrative inhibition. 
These articles and another by Stanley 
(1952), along with a late experimental 
Paper by Hull and others (Hull, 
Livingston, Rouse, & Barker, 1951) 


2 It is questionable that “inhibition” is the. 


appropriate and correct term to use in re- 
gard to response decrement related to frus- 
trative ncnreward. Better terms would be 
interference" or "competition," since frus- 
trative inhibition cannot be regarded as di- 
Tectly affecting the excitatory tendency out 
of which it grows. It is rather a new ex- 
Citatory tendency which competes with the 
er one, 
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and Hull’s (1952) last book, all con- 
tained at least the germ of a change 
toward a more active conception of 
nonreinforcement. An experimental 
paper by Denny and Dunham (1951) 
offered inhibition based on nonrein- 
forcement (frustration) as an alterna- 
tive to reactive inhibition to account 
for differential nonreinforcement ef- 
fects in a T maze. 

Brown and Farber (1951) outlined 
a theory in which nonreward was one 
of several antecedents to frustration. 
In this paper, the major emphasis was 
on the definition of frustration in terms 
of competing response tendencies and 
the authors were concerned only with 
the motivational (drive) properties of 
frustration, and not with inhibitory 
effects. In the same year, a paper by 
Amsel (1951) made the point that 
fractional anticipatory frustration (rr), 
which is the classically conditioned 
form of the frustrative reaction to non- 
reward, provides a mechanism for con- 
ceptualizing the active properties of 
nonreinforcement and should be re- 
garded as a determiner of inhibitory 
effects. And, at about the same time, 
Seward (1951) published a theoretical 
paper which mentioned, without elabo- 
ration, the possible functional proper- 
ties of an anticipatory frustration 
factor. 

The remainder of the decade, 1950 
to 1960, has seen a number of addi- 
tional experimental and theoretical at- 
tacks on the problem of nonreward. 
On the one hand there are experiments 
and theoretical treatments by investi- 
gators in the Hullian tradition of be- 
havior theory which have sought to 
extend and revise Hull’s theory in re- 
gard to the status of nonreward. The 
other hand holds a number of published 
studies representing a variety of theo- 
retical and nontheoretical interests 
which have contributed information 
about frustrative nonreward. 
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Much of this work has been re- 
viewed earlier (Amsel, 1958; Lawson 
& Marx, 1958) ; but some of it is very 
recent. The experiments fall into 
three categories: (a) demonstrations 
of the motivating effects of frustrative 
nonreward, i.e., the carried-over, en- 
hancing effect of nonreward on the 
vigor of immediately following be- 
havior. This has been termed the 
Frustration Effect (FE). Several 
studies, by far the greatest number, 
have been directed not only at demon- 
strating the effect but also at examin- 
ing the variables determining its occur- 
rence and strength (Amsel, Ernhart, & 
Galbrecht, 1961; Amsel & Hancock, 
1957; Amsel & Penick, 1962; Amsel 
& Roussel, 1952; Bower & Stocks, 
1960; Longstreth, 1960; Marzocco, 
1951; Penney, 1960; Roussel, 1952; 
Seward, Pereboom, Butler, & Jones, 
1957; Wagner, 1959). (b) The sec- 
ond category consists of experiments 
which are mainly concerned with the 
frustration drive stimulus, but which 
also touch on frustration reduction as 
reinforcement (Amsel & Prouty, 1959; 
Amsel & Ward, 1954; Tyler, Marx, 
& Collier, 1959). (c) And finally, 
there are a few studies which deal with 
the inhibitory properties of frustrative 
nonreward. The inhibitory mecha- 
nism is fractional anticipatory frustra- 
tion (rp), a classically conditioned 
form of the goal response to frustrative 
nonreward. The mechanism and its 
theoretical properties have been identi- 
fied in detail (Amsel, 1958; Spence, 
1960), and experiments testing impli- 
cations of such a mechanism are avail- 
able (Bower, 1961; Goodrich, 1959; 
Haggard, 1959; Wagner, 1961a, 
1961b). Of particular importance in 
this connection are a series of experi- 
ments by Wagner (1961b) which go 
far toward establishing the proposition 
that frustrative nonreward has many 
of the properties of punishment, and 
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that rp operates in many respects like 
fear. These experiments would support 
such statements as (a) the greater the 
strength of frustration the faster the 
extinction (avoidance) of a nonre- 
warded response; (b) cues paired with 
frustrative nonreward acquire motiva- 
tional properties; and (c) the cessation 
of cues previously paired with frustra- 
tive nonreward serves to reinforce a 
new response. 

There are several other experiments 
on nonreward (or reward prevention) 
as a variable in learning which provide 
support for nonreward as a factor in 
inhibition. For example, studies by 
Lambert and Solomon (1952) and by 
Adelman and Rosenbaum (1954) 
related extinction to the blocking of 
instrumental behavior at various dis- 
tances from the goal. An experiment 
by Adelman and Maatsch (1955) 
found that resistance to extinction de- 
pends to some extent on what the sub- 
ject is allowed to do when it finds no 
reward in the goal box, ie., is frus- 
trated. The suggestion is that resist- 
ance to extinction is high when the 
subject makes a response to frustration 
Which removes it from the frustrating 
situation rather than one which keeps 
it in the apparatus. Ferster (1937, 
1958) found that stimuli signaling 
“time out” from positive reinforce- 
ment acquire aversive properties. He 
also showed that the withdrawal of 2 
positive conditioned reinforcer had the 
functional properties of punishment, 
suppressing the rate of responding 
much like the presence of a negative 
reinforcer. These experiments point 
clearly to the inhibitory (aversive 
properties of nonreward. cH 

Some recent experiments, whic 
show decreasing resistance to extint 
tion with increasing numbers 2% 
Magnitudes of reinforcement, must. 
Tegarded as supporting a conceptio" 
of nonreward as an active factor. 
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example, Hulse (1958) and Armus 
(1959) found that larger magnitudes 
of reinforcement in acquisition are 
followed by faster extinction. Very 
recent reports by Reynolds and Siegel 
(1961), by Pavlik (1961), and by 
Wagner (1961a) were to the same ef- 
fect. North and Stimmel (1960) have 
shown that rats reinforced 90 or 135 
times extinguish faster than those re- 
inforced 45 times. Similar results 
have been reported by Reynolds, 
Richter, and Carlock (1960). They 
found faster extinction following .5- 
than following .l-gram reward, and 
faster extinction after 66 acquisition 
trials than after 6, 12, or 30 trials. 
This kind of result is quite in accord 
With results from studies of overlearn- 
Ing and discrimination reversal, which 
we will look at later. Together, they 
are as incompatible with early Hullian 
theory, or any other theory requiring 
Monotonicity and positive relationships 
in these variables (e.g., the various 
mathematical approaches), as were the 
carly partial reinforcement findings. 

€ will come back to some of these 
experiments later, and my discussion 
Of partial reinforcement effects will 
show why a theory which includes a 
frustrative nonreward hypothesis must 
Predict such results. 

There are by now several experi- 
Ments which can be taken to indicate 
that nonreinforcement is more im- 
Portant than reinforcement in discrimi- 
Nation. Chronologically, these are 
Studies by Fitzwater (1952). Grove 
and Eninger (1952), Eninger (1953), 
Shoemaker (1953), Cantor and Spiker 
(1954), Grice and Goldman (1955), 
Birch (1955), and Solomon (1956), 
O mention some of the earlier ones. 

Very recent study which makes this 
Same point has been reported by 
Lachman (1961). This is not the 
Place for 4 detailed description of these 
Studies ; however, the burden of evi- 


dence is that avoidance of nonreward 
is a more powerful factor in discrimi- 
nation than approach to reward; and 
this is, of course, very relevant to a 
point of view which makes nonreward 
a determiner of inhibition. Perhaps 
Harlow and Hicks (1957) were re- 
acting not only to their own experi- 
mental findings but also to the weight 
of this kind of evidence when they 
argued for a "uniprocess" rather than 
a "duoprocess" theory of discrimina- 
tion learning, the single process being 
inhibition. The weakness of their 
argument is that the negative process 
(rp, in my terms) may well depend on 
the prior existence of a positive process 
(see Amsel, 1958; Spence, 1936, 
1937), and one is back to two proc- 
esses again. 

Other examples of a return to an 
active conception of nonreward outside 
of the Hull-Spence camp can be found 
in the recent work of Mowrer and 
Estes. Mowrer, in his Learning 
Theory and Behavior, makes frequent 
use of the notion of nongoal events 
arousing "frustration" or "anger," in 
addition to "disappointment" and ex- 
presses agreement with the position I 
favor in regard to frustrative goal 
events (Mowrer, 1960, p. 409). 
Estes (1959) has recently acknowl- 
edged that statistical learning theory 
must cope with the problem of an 
active negative process based on non- 
reward in these words: 


... I have found that in the naive animal 
an unreinforced trial produces no apparent 
change in response probabilities, but that 
after a series of reinforced runs to a given 
side of a T maze, the introduction of non- 
reinforcement yields a decrement in proba- 
bility of response to that side. . . . Appar- 
ently under partial reinforcement schedules, 
the effect of nonreinforcement varies be- 
tween these extremes . . ." (p. 417). 


From this brief sketch of the recent 
history of the concept of nonreward, 
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we move to a description of the role 
of active conceptions of nonreward 
in interpretations of noncontinuous 
(partial) reinforcement effects. 


NONREWARD IN INTERPRETATIONS OF 
PARTIAL REINFORCEMENT 


In his 1958 chapter on learning, 
Lawrence differentiates between “in- 
tertrial” and “intratrial” interpreta- 
tions of the Partial Reinforcement 
Effect (PRE)—the finding that partial 
or intermittent reinforcement in learn- 
ing leads to greater resistance to 
extinction than does continuous rein- 
forcement. Intertrial explanations at- 
tribute the PRE in one way or another 
to immediately carried-over traces of 
stimulation from one trial to the next 
in acquisition. According to such inter- 
pretations, it is on the basis of such 
traces that acquisition is more clearly 
discriminated from extinction by con- 
tinuously than by partially reinforced 
subjects. Intratrial explanations ascribe 
the PRE to some kind of learned 
mechanism developing on each trial in 
acquisition and having its effect, as- 
sociatively, later in extinction. There 
have been two classes of intertrial ex- 
planations. In both cases the classes 
can be termed Cognitive-Expectancy 
(C-E) interpretations, and Stimulus- 
Response (S-R) interpretations. 

Generally speaking, the intertrial 
interpretations were an earlier develop- 
ment than the intratrial interpretations. 
Early C-E interpretations were the 
“common sense" expectancy hypothe- 
sis which emerged out of the early 
important experiments of Humphreys 
(e.g., 1939, 1940) and the discrimina- 
tion hypothesis developed and tested 
by Bitterman and his associates (e.g., 
Bitterman, Fedderson, & Tyler, 1953). 
On the S-R side, the major intertrial 
explanation (apart from the ep es 
unit" interpretation which applies to 
chained or highly massed responses 
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but not to discrete trial learning) was 
the Hull-Sheffield hypothesis tested by 
Sheffield (1949). Weinstock’s (1954) 
demonstration that partial reinforce- 
ment effects were not eliminated even 
when trials were separated by 24 
hours, and an experiment by Wilson, 
Weiss, and Amsel (1955) disconfirm- 
ing the earlier results of Sheffield 
helped to set the stage for, and indeed 
to provide, intratrial explanations of, 
the PRE. Much later, in fact very re- 
cently, there has been strong contra- 
indication for a discrimination hy- 
pothesis in the work of Jenkins 
(1961b) and of Theios 1962). 
Jenkins, using pigeons in a Skinner- 
type situation, and Theios, using rats 
in a runway, have found that the PRE 
is sustained when a large number of 
continuous reinforcements are inter- 
polated between discrete trial partial 
reinforcement and extinction. This 
means that a large number of continu- 
ous reinforcements immediately be- 
fore extinction and following partia 
reinforcement does not neutralize the 
partial reinforcement effect. Since the 
basis for discriminating acquisition 
from extinction is now the same for 
partially as for continuously reinforce 
subjects, and the PRE persists, 2^ 
interpretation is required, as Theios 
indicates, in terms of “relatively per- 
manent effects of nonreinforcement 
which can be sustained through blocks 
of continuous reinforcement.” 
Intratrial explanations of the PRE 
concern themselves in one way or am 
other with processes following n0™ 
reward. Until recently, the intratri4 
explanations were the exclusive prop 
erty of S-R psychologists. Wet 
stock's (1954) contiguity-interference 
hypothesis was perhaps the earliest © 
these, but was followed in short orde" 
by a variety of neo-Hullian hypothe 
ses. All of these are what I would 
term conditioning-expectancy hypothe 
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Ses as opposed to cognitive-expectancy 
hypotheses of the PRE. All depend 
upon constructs of the anticipatory 
goal response type, representing re- 
sponses which develop in strength 
within trials. There are three neo- 
Hullian interpretations of the PRE, all 
of which were first proposed to account 
for particular experimental findings. 
The first emerged out of a study by 
Wilson, Weiss, and Amsel (1955) and 
was subsequently elaborated and tested 
In some detail (Amsel, 1958). It at- 
tributes the PRE to the evocation of 
frustration by nonreward in partial 
reinforcement, to the conditioning of 
anticipatory frustration (rp — sp) to 
cues in the instrumental sequence, and 
to the association of sp with approach 
Tesponses in acquisition. Logan’s in- 
terpretation of the PRE (Logan, 
Beier, & Kincaid, 1956) is that it 
depends on the extinction of rg 
to — "postreinforcement-time ^ cues." 
Kendler and his associates (Kendler, 
Pliskoff, D'Amato, & Katz, 1957) 
have offered an interpretation similar 
to the one I have favored, except they 
Choose a neutral designation of non- 
Teward effects in acquisition and refer 
to anticipatory nonreward and not 
to anticipatory frustration. Spence 
(1960) has recently come out in favor 
of the same kind of interpretation and 
has employed the rp construct to ac- 
Count for acquisition. phenomena in 
Partial reinforcement. Consequently, 
it seems safe to say that a prominent 
Interpretation of the PRE in neo- 
ullian writings is a conditioning-ex- 
Pectancy position with identifies frac- 
tional anticipatory frustration as well 
as fractional anticipatory reward. 
Recently, Festinger (1961) has 
Called attention to the importance of 
“insufficient rewards" in a cognitive- 
€xpectancy (cognitive dissonance) in- 
terpretation of the PRE. Cognitive 
'ssonance (Festinger, 1957) is a hy- 


pothesis which recognizes the active 
properties of nonreward. In its use of 
delay of reward procedures to enhance 
attractiveness of a goal in a delay of 
reward situation, Festinger’s treatment 
of insufficient rewards is very similar 
in conceptualization to some earlier 
work of Olds (1953, 1956). On the 
basis of experiments with children, 
Olds concluded that the reward value 
of a secondary rewarding stimulus 
(S7) could be increased—and hence the 
strength of approach responses leading 
to S* could be increased—by delaying 
the presentation of S" after overlearn- 
ing. He showed that tokens which 
could ultimately be exchanged for 
more primary reward increased in re- - 
ward value when their presentation 
was delayed, if these delay trials fol- 
lowed a number of trials in which their 
presentation was immediate. Olds 
suggested that such a procedure con- 
stitutes “practice at wanting” and that 
such practice at wanting increases the 
value of that which is wanted (S7). 
Festinger would seem to be taking the 
same position in his animal research. 
In one of several experiments dealing 
with dissonance in rats, he employs a 
double runway in which a start box is 
separated from a mid box by an alley, 
and the mid box is separated from an 
end box by another alley. Subjects 
are delayed in the mid box in the 
manner of Holder, Marx, Holder, and 
Collier (1957) and Wagner (1959), 
and the finding is that the subjects de- 
layed in the mid box, before being al- 
lowed to run to the end box and find 
food, will continue to run to the mid 
box longer on test trials than the sub- 
jects always fed in the mid box as well 
as in the end box. Festinger's (1961) 
interpretation is that the delayed sub- 
jects develop dissonance as a result of 
the delay, and that the “dissonance 
tends to be reduced by developing some 
extra preference about something in 
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the situation. The existence of this 
extra preference leads to the stronger 
inclination to continue running during 
extinction trials" (p. 9). 

'The position of Festinger, like the 
earlier one of Olds, is clearly that non- 
reward (or delay of reward) is an 
active factor determining an increased 
want or attraction. However, there 
seems to be an important difference in 
what increases in attractiveness be- 
tween Festinger's earlier treatments 
of dissonance in humans (Festinger, 
1957) and his application of the con- 
cept to the explanation of the PRE in 
rats (Festinger, 1961). While speci- 
ficity is not always a characteristic of 
cognitive theory, it does seem as 
though the 1957 treatment (and even 
parts of the 1961 paper) did specify 
that the violation of a cognitive expec- 
tation produces dissonance which is 
resolved by upgrading the value of the 
previous goal object. However, in the 
treatment of "insufficient rewards" in 
terms of the prototype experiment with 
start box, mid box, and end box, dis- 
sonance is said to develop in the mid 
box, where the subject never has found 
food; and dissonance is reduced “by 
developing a preference for something 
about the place where the dissonance 


was introduced . . . by developing 
some liking for the place where they 
were not rewarded...” (p. 10). 


This latter position is used to explain, 
in terms of dissonance, why the sub- 
jects never fed in the mid box run 
longer to the mid box on nonreward 
test trials than the subjects always fed 
there before. The same thinking is 
then carried over to explanation of the 
PRE. In terms of the S-R analysis, 
the subjects have been trained to ap- 
proach the mid box of Festinger's 
situation despite anticipatory frustra- 
tion (sr) cues because, in the he 
prior to testing, they were always 
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finally fed in the end box. In C-E 
terms the subjects developed an extra 
preference for the nonreward situation 
—the mid box in which they had never 
been rewarded. This would leave to 
be answered such questions as: Why 
does successive nonreward result in 
extinction? And, how do the subjects 
learn to discriminate cues leading to 
reward from cues leading to non- 
reward ? 

The S-R treatment of nonreward ef- 
fects in partial reinforcement (Amsel, 
1958; Kendler, Pliskoff, D'Amato, & 
Katz, 1957; Spence, 1960; Wilson, 
Weiss, & Amsel, 1955) does not sug- 
gest that nonreward (or delayed 108 
ward) increases "wanting" in Olds 
terms, or "preference about something 
in the situation" in Festinger's. This 
type of neo-Hullian approach is a de- 
Scription of a mechanism operating 
through anticipatory nonreward which 
can lead to either rapid extinction of 
behavior or to increased resistance to 
extinction, and which can either facili- 
tate discrimination learning or retar 
it. I will claim that neither increase 
"wanting" nor "preference"—in fact, 
no "common sense" cognitive inter- 
pretation—can account for all of the 
effects which seems tO 
have. . 

The difference between the two pos! 
tions, in the cognitive language of Olds 
and Festinger, is as follows: While 
the C-E position has nonrewat 
increasing attractiveness of the non” 
reward situation, the S-R positio” 
implies a decrease in attractiveness: 
The partial reinforcement effect !*' 
according to Festinger, due to 1 
creased attractiveness, According t° 
my position it is the result of training 
to persist in behavior despite decrease 
attractiveness. In some other respec? 
the positions are similar: both vue 
nonreward the important determina? 


nonreward 
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of the effect, and both stress the im- 
portance of developing (intratrial) as- 
sociative tendencies during acquisition 
as determinants of ultimate resistance 
to extinction. Neither necessitates an 
assumption that the subject discrimi- 
nates acquisition from extinction (Bit- 
terman, Fedderson, & Tyler, 1953). 

The details of the S-R position 
Which I favor have been presented 
elsewhere (Amsel, 1958) in the form 
of a sequence of hypotheses identified 
with four stages of practice in dis- 
crete trial partial reinforcement or in 
discrimination learning (the first 
three stages are essentially the same 
for both). A fifth hypothesis which 
applies to discrimination learning has 
been added, and this will be important 
for some data which follow. The es- 
Sence of the position is outlined below 
in relation to an apparatus we have 
been using. It was designed to dif- 
ferentiate the immediate motivating 
(energizing) effects of frustration ex- 
emplified by the so-called Frustration 
Effect (FE) from fractional anticipa- 
tory frustration (rg), which represents 
the inhibiting feature of frustration. 

he apparatus is essentially, two run- 
Ways in series, either in a straight line 
(eg. Amsel & Roussel, 1952) or in 
the form of an L (eg. Amsel & 
Hancock, 1957). The apparatus has 
a start box (SB), a first runway 
(Runway 1), a first goal box (Gl), a 
Second runway (Runway 2), and a 
Sécond goal box (G2) in successive 
Arrangement. Reward and nonreward 
are manipulated in Gl, and the ap- 
Paratus permits a separation of the 
frustrated response and changes in 
Tunning speed indicating Tr (measured 
in Runway 1), from the frustration 
Motivated response and changes indi- 
cating the FE (measured in Run- 
Maye?) . 
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The sequence of hypotheses in terms 
of five stages of practice is as follows: 

1. In Stage 1, rn —Sr® is developing 
in Runway 1 with early rewards, and 
nonreward (in Gl) has no particular 
effect, as measured by the FE in 
Runway 2. 

2. After the development of rg — Sr 
for the Runway 1 response, nonre- 
wards elicit frustration, measured by 
the FE in Runway 2. This is Stage 2. 

3. When nonrewards elicit frustra- 
tion, the cues in Runway 1 previously 
evoking rg now also begin to evoke rp. 
In Stage 3 these antedating goal re- 
sponse tendencies are temporarily in 
competition. 

4. Since rp and rp cannot be elicited 
separately by differential cues in par- 
tial reinforcement, as they can in the 
latter stages of discrimination learn- 
ing, and since the temporary conflict 
in partial reward training: is resolved 
in favor of running to the intermit- 
tently rewarding goal box, sy becomes 
associated with the instrumental ap- 
proach response in Stage 4 of par- 
tial reinforcement training, providing 
the mechanism for the partial rein- 
forcement effect. When extinction is 
carried out, partially reinforced sub- 
jects have been trained to respond (ap- 
proach in the presence of antedating, 


3T] employ the term, ra — Sr, in the same 
manner as it is used in an earlier develop- 
ment, and for the same reason: Hoe Eg ed 
is a general term covering all types of ante- 
dating conditioned response. Separately, 
these might be designated ra — Sx, fractional 
anticipatory reward; rr — Sp, fractional an- 
ticipatory punishment, usually termed fear or 
anxiety; and rr— SF, fractional anticipatory 
frustration” (Amsel, 1958, p. 102). The 
meaning of rg — Sr is, however, not different 
from ra — se as Spence (1960) uses it in a 
similar connection. All that is intended here 
is a clarification of terminology in light of 
the fact that goal events may be positive 
(such as to elicit approach) or negative 
(such as to elicit avoidance). 
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frustration produced stimuli), whereas 
consistently reinforced subjects have 
not. In discrimination situations there 
is a basis for the differential elicitation 
of approach and avoidance by sp and 
sp respectively in Stage 4. 

5. Following discrimination, when 
rp consistently precedes nonreward, 
the FE following nonreward should 
grow weaker, since nonreward is de- 
fined as frustrating only in the presence 
of rp. 

I have been suggesting that in a very 
important area of the analysis of be- 
havior, the explanation of persistence 
of behavior in the face of nonrewards 
attributable to partial reinforcement, 
the role of nonreward as an active 
process has emerged in both cognitive- 
expectancy and in S-R conditioning 
interpretations. It would certainly 
seem to me that this is an area where 
the clearer, better worked out S-R lan- 
guage and conceptualization might pro- 
vide a handle for a more workable 

, expectancy theory. 

There is, in addition, the matter of 
a basic difference between the cognitive 
and S-R interpretations of the PRE: 
the former says nonreward increases 
attractiveness; the latter says non- 
reward decreases attractiveness, Both 
positions can account for the PRE; both 
can account for faster extinction fol- 
lowing large rewards than following 
smaller rewards; both can account for 
faster extinction following many re- 
wards than following fewer rewards. 
However, a cognitive theory such as 
Festinger proposes would have dif- 
ficulty with a finding of Hulse (1958) 
which has recently been confirmed by 
Wagner (1961a). In Hulse's words: 
* .,large as compared with small 
rewards produced greater resistance to 
extinction if partial reinforcement were 
used, but less resistance to extinction 
if continuous reinforcement were used 

, " (p. 56). In the cognitive lan- 
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guage of Festinger this would have to 
mean that the subjects develop "some 
extra preference about something 1n 
the situation" (greater resistance tO 
extinction) when rewards are small 
and continuous; and that they do not 
develop the extra preference, in fact 
quite the opposite, when rewards are 
small and partial. The result of Hulse 
and of Wagner is clearly in line with 
the S-R interpretation in terms of 
anticipatory responses: Extinction 1$ 
faster following continuous rewards of 
large than of small magnitudes because 
of the greater FE in extinction an 
subsequent faster development of rr 
However, in partial reinforcement 
there is opportunity to elicit FE an 
to build rp during acquisition. Under 
these circumstances, the larger magni- 
tude of reward permits the mecha- 
nism for the PRE—the connection 9 
Sp to continued approach—to develop 
more strongly. Consequently extin 
tion should be slower following partia 
rewards of large magnitude than fol- 
lowing partial rewards of small magni- 
tude. f 
I am claiming that the S-R condi- 
tioning analysis of nonreward effects 
has much to recommend it. The ques- 
tion is, How much? To break the 
question down: Does inclusion of the 
concepts rg and rp in the analysis o 
partial reinforcement and discrimina 
tion learning provide us with explana- 
tory power which a more parsimonious 
approach-avoidance analysis (€87 
D'Amato & Jagoda, 1960; Nisse™ 
1950) does not? Does this type de 
analysis of nonreward effects provi K 
predictive power which a cognitiv® 
treatment of nonreward (eg. Hes 
tinger, 1961) does not? der 
My attempts to convince the rea By 
that the answer to these last two que 3 
tions is “yes” will take two forme, 
(a) Some data will be presented whiC 


SE on 
bear on predictions already made 
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the basis of this type of analysis. I will 
claim that data such as these are not 
readily deducible from either a less 
specific cognitive language or from a 
more parsimonious approach-avoidance 
language. (b) I will offer an exten- 
Sion of frustrative nonreward theory 
designed to generate predictions about 
the course of discrimination learning 
following various  prediscrimination 
experiences with partial and continu- 
ous reward. 

Before proceeding to this extension 
of the theory of frustrative nonreward, 
a brief comment on the relation of the 
present position on nonreward to other 
theories of frustration. As I have 
indicated before (Amsel, 1958), “. . . 
we have been interested in certain 
active properties of nonreward follow- 
ing reward and no more than this is 
meant by the term frustration as we 
use it" (p. 103). Nevertheless, the 
thechanism which has been outlined to 
explain the PRE, particularly the con- 
ditioning of rp and the connection of 
Sp to continued approach as a result 
of intermittent reward, is certainly a 
candidate for the learning theory 
counterpart of “frustration tolerance” 
as employed in social psychology and 
Psychopathology. The treatment of 
frustration from the viewpoint of theo- 
ties of personality is beyond the scope 
of this paper; and there is no intent 
here to grapple with the multiple 
Meanings the concept has in those 
contexts, However, it would seem 
Possible to recast portions of the frus- 
tration theory of Rosenzweig (e.g. 
1944) which deal with nonreward 
(failure or delay of “gratification’’) 
Into the present terminology; and the 
treatment of frustration by Rotter 
(1954), as a discrepancy between re- 
Ward expectancy and its occurrence, 
Seems compatible with the mechanisms 
hypothesized here and with the theo- 
retical extension I will propose. 


NONREWARD EFFECTS IN 
DISCRIMINATION LEARNING: 
Some Data 


According to the sequence of five 
hypotheses on the role of nonreward 
and partial reinforcement in discrimi- 
nation, it should be possible to show 
that the FE, measured in Runway 2 
of a double runway, bears certain rela- 
tionships to various stages of discrimi- 
nation learning. More specifically, 
(a) evidence of nonreward related 
frustration, measured in Runway 27 
should precede any evidence that the 
subjects are running faster to S4- and 
slower to S— in Runway 1, and (b) at 
some point following the appearance of 
discriminative behavior in Runway 1, 
a reduction in the magnitude of the FE 
measured in Runway 2 should become 
apparent. 

In an unpublished doctoral disserta- 
tion, Ward (1961) employed the 
double runway to record discrimina- 
tion to positive (e.g. black) and 
negative (e.g. white) cues presented 
singly in Runway 1, and the concur- 
rent development of frustrative effects 
in Runway 2. A single runway condi- 
tion (Runway 1 alone) was used to 
study the same discrimination uncon- 
taminated by Runway 2 performance. 
There were 20 subjects in the double 
runway group and 10 subjects in the 
single runway comparison group. 
They were run six trials a day for 30 
days, and the intertrial interval was 
about 10 minutes. Figure 1 presents 
data for the 20 subjects in the double 
runway. The top panel shows the 
developing FE in Runway 2. The. 
bottom panel shows discrimination 
starting at about Day 7 (42 trials), 
and at a point well after the beginning - 
of FE development on Day 3 (18 
trials). There is an indication that the 
FE almost disappears (perhaps only 
temporarily) at a point where separa- 
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MEAN MEDIAN RUNNING TIME (Yog SEC.) 


DAYS 


Fic. 1. The appearance and diminution 
of the frustration effect in Runway 2 (top 
panel) in sequential relationship to the de- 
velopment of the successive discrimination 
in ad 1 (bottom panel). (From Ward, 
1961) 


tion of the discrimination curves is 
greatest. This occurs at about Day 15 

(90 trials). 

It could be argued the Ward result 
is an artifact: that different subjects 
reach criterion at different times. An- 
ticipating this, I selected the data of 
seven of Ward's subjects which 
reached the discrimination criterion on 
the same block of trials. Figure 2 
presents these data in 2-day blocks 
and shows essentially the same rela- 
tionships as in Figure 1. 

Figure 3 presents data from the 
single runway comparison group and 
shows the Runway 1 discrimination 
unaffected by any subsequent Runway 
2 performance. Even in the absence 
of the second runway there is a tend- 
ency for the N and R curves of the 
discrimination to converge on later 


‘rials (see Berry, Verplanck, & 
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Graham, 1943). These data of Ward, 
which have been duplicated in our 
laboratory as by-products of other 
studies, are taken as evidence for the 
involvement of frustrative effects in 
discrimination in a manner such as our 
analysis suggests. 

In another experiment using the 
same double runway apparatus, Ward 
(1961) studied rate of discrimination 
learning in Runway 1 as related to 
number of prediscrimination rewards. 
In this experiment, which was similar 
to an earlier one of Shoemaker (1953) 
performed in a single runway, the re- 
Sponse of running in Runway 1 and 
entering G, was rewarded 0, 12, or 
48 times prior to learning a Mss 
White discrimination in Runway |. 
An implication of our position is that, 
With initial positive tendencies to the 
two discriminanda equal, rate of dis- 
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Fic. 3. The successive discrimination in 
a group which ran only in Runway 1. (Ex- 
treme N times going up to 3 or 4 seconds 
are not shown, Note the decline of the non- 
Teward curve in the single runway group as 
In the double runway groups.) 


crimination learning should be a posi- 
tive function of the strength of rp to 
the negative stimulus. This meant 
that prediscrimination trials were all 
Tewarded in G, and that Runway 1 was 
black on half of these trials and white 
9n the other half. On discrimination 
trials, Runway 1 was black when G, 
Contained reward and white when G, 
Contained no reward (or vice versa). 

€cond runway performance was meas- 
ured on every discrimination trial fol- 
lowing reward or nonreward in G, to 
Set a measure of the FE, whose 
Strength at different stages could then 
€ related to the developing discrimina- 
tive performance. Table 1 shows the 
Tesults of this experiment for both 
Tunning time and goal entry measures 
for two criteria of learning. A more 
detailed analysis of the data in Ward’s 
dissertation also indicates the appear- 
ance of the FE before discrimination 
and its diminution or disappearance 
following. discrimination. These data 
are in accord with, and go a little 
SYond, previously cited studies which 


317 


stress the importance of nonreward in 
discrimination. 

A recent experiment by Jenkins 
(1961a) demonstrates the importance 
of nonreward effects in discrimination 
learning in another way. Using 
pigeons as subjects, a discrete trial 
procedure, and key pecking as the cri- 
terion response, Jenkins demonstrated 
that the positive stimulus elicits twice 
as many responses to extinction follow- 
ing discrimination training as does the 
same stimulus following an equal num- 
ber of continuous reinforcements. On 
the basis of an ingenious arrangement 
of controls, Jenkins is able to conclude 
that the increased resistance to extinc- 
tion is not due to the fact that the 
positive stimulus is eventually dis- 
criminated from the negative, but 
rather to the occurrence of nonrein- 
forced responses in the process of 
learning the discrimination. 


FnusrRATIVE NONREWARD THEORY 
AND PREDISCRIMINATION EFFECTS 


In the Ward experiment, subjects 
were exposed before a discrimination 


TABLE 1 


Mean Day on WuHicH CRITERION or Dis- 
CRIMINATION REACHED BY GROUPS 
FOLLOWING 0, 12, OR 48 PRE- 
DISCRIMINATION B+ W+ 


TRIALS 
Group 
Condition F 
0 12 48 
Running time 
Easy criterion 8.0 6.5 | 4.5 |3.64* 
Hard criterion | 12.8 | 10.3 | 8.6 | 5.26** 
Goal entry time 
Easy criterion | 7.2 | 4.3 | 4.1 | 3.22* 
Hard criterion | 10.9 8.1 | 6.1 |6.14** 
* .05. 
=b S01 
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reasoning for the intermediate case is 
that both rg and rp are being evoked 
by B at the start of discrimination, but 
sp elicits approach and sp elicits avoid- 
ance. When, after 32 trials, we switch 
from prediscrimination B+ to the 
B--W- discrimination, B+ evokes 
both approach and avoidance tenden- 
cies through sr and sp, and W—, new 
in the discrimination, evokes mainly 
generalized approach (rg) and little 
avoidance (rp). When the shift is 
from B+ to B—W+, there is a strong 
tendency for sp to elicit avoidance in 
relation to the now negative B— (gen- 
eralizing very little to W+) and a 
strong tendency for sp to elicit ap- 
proach to the new, now positive, W+. 
This latter situation is much more 
favorable for discrimination. The pre- 
diction: After an intermediate number 
of partial rewards of an approach 
response to B, B+W— will pro- 
duce slower discrimination than will 
B-W-+. 

After a few B trials, enough to 
produce some rg but little or no FE 
and no rr, there should be no differ- 
ence in the rate of the B--W— and 
B—W+ discriminations, 


Continuous Reward of a Response to 
One Discriminandum Prior to Dis- 
crimination 


We deal here with the underlined 
conditions on the right of Figure 4: 
continuous (B+) prediscrimination 
experience followed by a discrimina- 
tion in which B remains positive 
(B+W-) or becomes negative 
(B—W+). The expectations here 
are most interesting when we compare 
them to the predictions just made 
for prediscrimination partial reward. 
Briefly, with increasing numbers of 
rewards of approach responses to the 
rediscrimination stimulus, B+ should 
evoke increasing rr, and nonreward in 
the subsequent discrimination should be 


increasingly frustrating. Since rr gen- 
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eralizes strongly from B to W, both B 
and W should evoke rp quite strongly 
and almost equally in the discrimina- 
tion. Nevertheless, the FE should be 
somewhat greater when B is made the 
negative stimulus in the discrimination 
than when it stays positive. However; 
in either case (B+—>B+W-— or B+ 
—B-—W+), the flat gradient of gen- 
eralization of positive tendencies leads 
to a prediction which is the same as 
the earlier Ward result: increasing the 
number of continuous prediscrimina- 
tion rewards should produce faster dis- 
crimination. 

In contrast to the center panel of 
Figure 6 in which the relationships at 
the middle and end points are reversed, 
the differences between the two curves 
in the top panel are shown as consist- 
ently in favor of faster discrimination 
for B--—B4-W-—. This is not a pr 
diction based on frustrative nonrewar* 
factors alone. In fact, part of this 15 
not a prediction at all. There is evt 
dence in some unpublished data" that 
a switch from B+ in prediscrimina- 
tion to B— in discrimination does pro" 
duce a greater FE in Runway 2 than 4 
switch from B+ to W—, as expected 
(Amsel & Hancock, 1957) ; however 
an immediate generalization decrement 
of the Runway 1 response in switching 
from B to W weakens response to W? 
which speeds the apparent separatio 
of the B-+W-— discrimination curves 
and retards the separation of the 
B-—W+ curves. The suggestion © 
that, in the case of prediscriminatio”, 
continuous reinforcement, the factor ° 
generalization decrement, operating 
against the somewhat greater FE wher 
the prediscrimination. stimulus . 
comes negative in the discrimination, 
should maintain the relative advanta8 
of B--—B-4-W-— over B+—B— 


5'The reference is to data collected, Pe 
Susan Hicks at Newcomb College, TU ate 
University, as part of an undergrad 
honors project. 
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for all numbers of prediscrimination 
trials. In prediscrimination partial re- 
inforcement (middle panel of Figure 
6), since there is a switch-over in rela- 
tive positions from 32 to 120 trials, the 
factor of generalization decrement 
should reduce the apparent difference 
at the intermediate point and increase 
the reversed difference at the 120-trial 
point, 

While some of the implications of 
the analysis of variables represented 
by the schema in Figure 4 are cur- 
rently being tested using a runway 
apparatus with amplitude changes as 
the response measures, we plan also to 
test these ideas in terms of rate of dis- 
crimination in two-choice situations 
with single presentation of discrimi- 
nanda, In these latter situations, gen- 
cralization decrement effects due to 
introduction of new stimuli would still 
be expected to effect running speed ; 
however, they might have less effect on 
a correct-choice indicant of speed of 
discrimination, and provide an addi- 
tional check on the implications of this 
position. 

. We have been looking at situations 
in which the response to a single 
Stimulus cue is partially or continu- 


. Ously rewarded some number of times, 


and this stimulus then becomes either 
the positive or the negative cue ina 
discrimination. The last two sections 
Of this paper will outline some of the 
factors operating in prediscrimination 
experience involving both discrimi- 
nanda, 


Partial versus Continuous Reward of 
Responses to Both Discriminanda Pre- 
sented Separately Prior to Discrimi- 
nation 


The bottom panel of Figure 6 sum- 
Marizes predictions about the rate of 
discrimination learning following pre- 
discrimination experience with both 
discriminanda presented separately. I 
am suggesting that B}W+>B+W— 


operates essentially like B+7B+W— 
(or B4—B—W-), except that it 
would not be susceptible to generaliza- 
tion effects, since neither B nor W is 
new in the discrimination. There is 
some evidence related to this condition 
from the Shoemaker and Ward experi- 
ments, but only for the part of the 
“function” covering the first two 
points. Analysis of B+W= as a pre- 
discrimination condition indicates a 
decreasing monotonic relationship be- 
tween rate of discrimination and num- 
ber of such prediscrimination trials. 
The reasoning is simply that partial 
reinforcement of approach responses to 
both stimuli will result in conflicting 
approach and avoidance response tend- 
encies to both B and W at the 32-trial 
stage; and the elicitation of approach 
responses to sp in connection with both 
B and W at the 120-trial stage. 
While the former represents a dif- 
ficult base on which to build discrimi- 
nation, as compared with B+W +, the 
latter should be even worse for dis- 
crimination, since it involves the 
mechanism for the PRE (sr connected 
to approach) elicited by the negative 
stimulus in the discrimination. Conse- 
quently, a prolonged period of con- 
tinued approach to both B+ and W— 
would be expected in the discrimina- 
tion phase. 


Discrimination Reversal as a “Pre- 
discrimination” Phenomenon 


The evidence on discrimination re- 
versal is certainly not unequivocal ; 
however, the weight of it suggests a 
U shaped relationship between ease of 
discrimination learning (B+W—) and 
number of prior B—W+ trials. This 
would look like the B+—>B+W-— 
prediction (middle panel Figure 6) 
in direction. Reversal, at least up 
to the 32-trial point, would produce 
successively slower learning; but re- 
versal after many postcriterion trials, 
e.g., the 120-trial point, would facili- 
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to both of the stimuli, and approach 
responses in the presence of both were 
always rewarded. The Ward experi- 
ment represents continuous (predis- 
crimination) reward experience with 
both discriminanda. Now let us extend 
frustrative nonreward theory to a class 
of prediscrimination treatments broader 
than that represented by the Ward 
experiment. 

Consider how discrimination learn- 
ing will be affected not only by 
continuous prediscrimination reward 
experience but also by partial pre- 
discrimination reward experience in 
relation to one or to both of a pair of 
discriminanda. I will restrict the dis- 
cussion, in this paper, to a black-white, 
successive discrimination (single pres- 
entation of stimuli). The problem is 
to make predictions about the course 
of such a discrimination on the basis 
of some history of consistent or of 
partial reward experience in relation to 
one or both of the eventual discrimi- 
nanda. Discrimination is measured 
in terms of amplitude (running time) 
changes to the positive and negative 
discriminanda, 

Table 2 lists several possible kinds 
of exposure to discriminanda prior to 
à black-white discrimination. As we 
examine these cases, it is clear that 
theorists of learning have been inter- 


TABLE 2 


KINDS OF PREDISCRIMINATION EXPERIENCE 
IN RELATION TO A BLACK (+) 
WHITE (—) DISCRIMINATION 


One Prediscrimination Stimulus 


B+ Continuous Reward, Positive Stimulus 

W-4- Continuous Reward, Negative 
Stimulus ' 

B& Partial Reward, Positive Stimulus 

W Partial Reward, Negative Stimulus 


Two prediscrimination stimuli 


B-4- W+ Continuous Reward, Both Stimuli - 


B+ W Partial Reward, Both Stimuli 
B— W+ Discrimination Reversal $ 
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ested mainly in prediscrimination ex- 
perience involving both discriminanda. 
An early example is the continuity 


versus — noncontinuity. experiments 
which varied amount of presolution 
experience with the discrimination 


problem itself (B--W—). or with its 
reverse (B—W4-), and with receptor 
orienting activity in relation to the 
discriminanda (e.g,  Ehrenfreund, 
1948; Gatling, 1951; Krechevsky, 
1932, 1938; Lashley, 1942; Spence, 
1940, 1945). More recent work on 
prediscrimination treatments as they 
influence later discrimination learning 
has taken several forms including 
Harlow's (e.g., 1949, 1950, 1957) ex- 
periments on learning sets, Lawrence 5 
(1949, 1950, 1952) studies on acquired 
distinctiveness of cues, and the work 
relating overlearning to discrimination 
reversal and transfer (eg, Birch, 
Ison, & Sperling, 1960; Capaldi & 
Stevenson, 1957; Pubols, 1956; Reid, 
1953). These newer interests in pre- 
discrimination experience have been 
somewhat less oriented than the earlier 
ones toward theoretical argument. 10 
fact, they have to some extent tende 
to bring.together the earlier wide split 
between S-R and cognitive approaches 
to discrimination, while investigating 
mediational processes affecting dis- 
crimination. Perhaps a point has bee? 
reached where it may be profitable uo 
begin looking at the prediscrimination- 
discrimination sequence in a way 
which will bring the cognitive-expect- 
ancy position even more into the fo 
of an S-R (conditioning-expectancy 
interpretation. It seems to me that 4 
treatment of discrimination phenomem 
in terms of anticipatory (conditione 
mechanisms does jüst this. ” 
Consider the following “attributes | 
of prediscrimination exposure ‘tos i 
Single stimulus: (a) The prediscrim 


5 5 = he 
nation stimulus may become either t 


g pe : : H he 
positive or.the negative stimulus in t 


iv aces e- 
eventual discrimination. (b) The T 
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sponse to this prediscrimination stimu- 
lus may be partially or continuously 
rewarded. (c) The number of predis- 
crimination exposures to the stimulus- 
response-reward (or nonreward) se- 
quence may be large, small, or of 
Some intermediate value. Figure 4 


`4The values for number of trials, 10, 32, 
and 120, which appear in Figures 4, 5, and 6, 
and in the text in various places, are speci- 
fied here for purposes of exposition and 
because we are now conducting experiments 
which involve these numbers of prediscrimi- 
nation trials. However, these numbers of 
trials only represent guesses, at the present 
time, as to when the various stages of prac- 
tice are reached. Rather than be pinned 
down to these numbers of trials in the pre- 
dictions for discrimination learning which 
follow, I would prefer to set up independent 
indices of the existence of one stage or 
another and to predict discrimination out- 
Come on the basis of these indices rather 
than some number of trials. The index for 
the first stage is relatively safe and simple. 
It is a stage at which nonreward produces 
no evidence of frustration as measured by 
the FE in a double runway, or in any other 
way. If there is no frustration, there can 
be no rr, The presence of rr takes us into 
the second stage. The choice of a number 
of trials to designate the second stage is 
most difficult since it must represent a very 
Critical phase in partial reinforcement: 
enough trials for rr to be present but not 
So many that sr is associated with approach. 
The independent indicator of the middle 
Stage (rr, but sr not associated with ap- 
Proach) is increased variability of behavior 
Soon aíter nonreward produces the FE 
(Amsel, 1958). Stopping partial reward 
acquisition at such a point defines this middle 
stage. The late stage (identified here, 
conservatively, as 120 trials) has a clear 
independent definition. It exists (a) when 
variability decreases again, but more particu- 
larly (b) when partial reward performance 
exceeds continuous reward performance in 
vigor (see Goodrich, 1959; Haggard, 1959; 
Spence, 1960; Wagner, 1961a). While it 
might appear from earlier partial reinforce- 
ment experiments that 32 trials of partial 
reinforcement are sufficient to get us to the 
last stage and produce the PRE (and there- 
fore are too many trials for our intermediate 
condition) experiments demonstrating PRE 
With this many acquisition trails (e.g, Shef- 


field, 1949; Weinstock, 1954) have usually 
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POSITIVE 
IN. DISCRIMINATION 
PREDISCRIMNATION 
STIMULUS 


NEGATIVE. 
IN. DISCRIMINATION 


CONTINUOUS 
PREDISCRIMINATION REWARD CONDITION 


PARTIAL 


Fic. 4. Schema showing 24 prediscrimi- 
nation-discrimination sequences (12 condi- 
tions, each counterbalanced) resulting from 
three attributes of prediscrimination ex- 
posure to a single discriminandum. 


shows the various possible combina- 
tions of a, b, and c for a black-white 
discrimination. The 12 cells of this 
schema represent a matrix of possible 
experimental conditions, each counter- 
balanced for absolute effects of color. 

What can be predicted from a be- 
havior theory with frustrative non- 
reward premises about the effects of 
prediscrimination experiences in rela- 
tion to a single discriminandum on 
subsequent black-white discrimination ? 
Is it possible to show that all combina- 
tions of a, b, and c and also effects of 
prediscrimination experience involving 
both discriminanda will be deducible 
from a relatively small number of prin- 
ciples? In the next four sections, I 
will look at four kinds of prediscrimi- 
nation treatment and, for each one, try 
to see what the theory leads us to 


expect. 


Partial Reward of a Response to One 
Discriminandum Prior to Discrimina- 
tion 

Our guiding conception, when the 
prediscrimination treatment involves 


involved an initial period of about 10 con- 
tinuous rewards. Our procedure involves 
partial reward from the outset. 
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partial reinforcement and a sufficiently 
large number of trials, is that antici- 
patory frustration produced stimuli 
(sp) become associated with approach 
responses. (Miller, 1960, has sug- 
gested a similar mechanism— "learning 
to advance to the cue of fear"—to ac- 
count for the learning of resistance to 
fear.) The reasoning is that while sp 
will at first elicit responses antagonistic 
to approach, it will finally become con- 
nected to approach tendencies. This 
means that at some stage in partial 
reinforcement sp will become part of 
a stimulus complex evoking approach, 
and earlier evidence of conflict will 
disappear. 

Of the four Stages of practice in 
partial reinforcement previously out- 
lined, three are theoretically differenti- 
able with respect to the involvement of 
Tp, and therefore sp: a stage where rp 
has been conditioned, but not yet rp 
(10 trials) ; a stage where both rg and 
rp have been conditioned and their 
stimuli, sp and sp, still evoke competing 
(approach and avoidance) response 
tendencies (32 trials); and a stage 
where both rp and rp are conditioned 
and the stimuli they produce, Sr and 
sp, both evoke approach tendencies 
(120 trials). This conceptualization 
of three stages in partial reinforcement 
in regard to rp—sp, along with an 
assumption borrowed from Miller's 
analysis of conflict and displacement 
(1944, 1948), will provide the basis 
for some deductions which I find 
i sting. "e 
The e — from Miller's analy- 
sis of conflict, for which there is good 
experimental support, is that the gen- 
eralization gradient for positive (ap- 
proach) tendencies. is flatter ne that 
for negative (avoidance) ten areas 
If we now identify sr App eh 
results from rg) and sg—AÀv (which 
results from rp) as such p Ta 
negative tendencies, the webs e 

of Figure 5, from top to 
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STRENGTH OF RESPONSE TENDENCY AT 
BEGINNING OF DISCRIMINATION FOLLOWING 
PREDISCRIMINATION PARTIAL REWARD 

TO BLACK (B?) 


EARLY STAGE 
(10 TRIALS) 


MIDDLE STAGE 
(32 TRIALS) 


KS cc Neon, 
(gcc MN 


esia seriada 
or cuc lE s 


M (120 TRIALS) 
TS c, 


~ 


RELATIVE STRENGTH OF RESPONSE TENDENCIES 


BLACK WHITE 


DISCRIMINATION STIMULUS 
Fic. 5. An application of an assumption 
from Miller's analysis of conflict to predic- 
tions of ease of discrimination learning fol- 
lowing various prediscrimination exposures 
to a single discriminandum. 


spectively, represent the state of af- 
fairs at the beginning of a discrimina- 
tion, when the discrimination has been 
preceded by a few, an intermediate 
number, or a large number of partial 
reinforcements of a response to the 
prediscrimination stimulus (B). Rep- 
resented on the left-hand side of the 
graph are the relative strengths of ap- 
proach and avoidance tendencies tO 
the prediscrimination stimulus (B) 
through the mediating response pro- 
duced stimuli, Sr and sp. Shown 08 
the right are the generalized strengths 
of these tendencies to the stimulus 
(W) which will be new in the dis- 
crimination, (If the prediscriminatio? 
Stimulus were W, then B would be the 
new stimulus in the discrimination a” 
the B and W positions on the baseline 
would of course be reversed.) J 
A simplifying assumption, repre 
sented in the bottom panel of Figure n 
is that the strength of approach, elicite d 
through rg—sy by B and W in the lat 
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Stage, is related to the strength of 
avoidance tendencies elicited through 
Tr—sp in the middle stage. I am sug- 
Sesting that the avoidance tendency to 
Sr is “neutralized” and has the effect 
of an approach tendency at this stage. 

Some predictions can now be made 
relative to the conditions in Figure 4, 
and these will be restricted to the 
underlined conditions. (The other 
Conditions simply counterbalance for 
ny absolute effects of color and I as- 
Sume they will not be different in 
direction.) Again, let me emphasize 
that the predictions will differ depend- 
ing on the depth dimension: Was 
Prediscrimination training discontinued 
at the stage of development of rp but 
Not rp, indicated by the point of earliest 
Appearance of FE? Or, was training 
Carried on to the intermediate conflict 
Stage at which sp evokes avoidance? 

T, were a very large number of trials 
Tun, allowing sp to become connected 
to approach? Outlined below, for each 
Of these cases, are predictions as to 
Which discrimination will be learned 
faster : that in which the prediscrimi- 
nation stimulus remains positive in the 
discrimination  (B&-—B-FW-—), or 
that in which it becomes negative 
(B5 B—W-4-). 

Let us begin with the case of dis- 
crimination following 120 partial re- 
Wards, The reasoning (see Figure 

dis At the start of discrimination, B 
Will elicit rz and rp strongly and about 
equally, On the other hand, W, which 
1S new in the discrimination, will elicit 
Tn relatively strongly because of the 

at positive gradient, but will elicit rp 
Very weakly because of the steep nega- 
live gradient. This means that when 

is made negative in the discrimini- 
Nation after a great many prediscrimi- 
Nation trials, B— will elicit strong 
Tr— sp, and sp will elicit continued ap- 
Proach, This will slow discrimination 
relative to the condition where B is 
made positive in the discrimination. 


Here the W— stimulus which is new 
in the discrimination elicits very weak 
rg. Consequently, there is little sp 
to elicit approach, little to counteract 
the build-up of avoidance to W—, and 
discrimination should develop more 
quickly. The theory predicts the fol- 
lowing: After a very large number of 
prediscrimination partial rewards of 
an approach response to B, B--W— 
will produce faster discrimination than 
will B—W-+-. This prediction and its 
relationship to other predictions are 
shown in Figure 6. 

The most interesting and critical 
comparison to be made for purposes 
of our analysis is between the 120- 
prediscrimination-trial case and the 
intermediate, 32-prediscrimination-trial 
case. In the 120-trial case sp is al- 
ready connected to approach, while in 
the 32-trial case sp still elicits avoid- 
ance. Consequently the predictions in 
these two cases are reversed. The 


PREDICTED 
RELATIVE RATE OF DISCRIMINATION LEARNING 


Bw 2BHW- 
Es 


ee 
B*W* 2B*W- 


10 32 120 


NUMBER OF PRE-DISCRIMINATION TRIALS 


Fic. 6. Predicted rate of discrimination 
learning following various prediscrimination 
exposures to one or both of the eventual 
discriminanda. 
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tate the formation of the reversed 
discrimination. i 

The discrimination reversal studies 
cited earlier have been conducted in 
situations involving joint presentation 
of stimuli. They involve two basic 
findings. The first concerns ease of 
discrimination following the first re- 
versal; the second concerns the “learn- 
ing to learn” phenomenon—that suc- 
cessive reversals shorten the numbers 
of trials required to criterion. For 
the first of these findings, there is scant 
evidence in situations involving single 
presentation of stimuli (the present 
case) ; however the recent Birch, Ison, 
and Sperling study (1960) is impor- 
tant and relevant. 

The conditions of the Birch, Ison, 
and Sperling experiment are closest to 
meeting the boundary conditions of a 
frustrative nonreward analysis of re- 
versal as a “prediscrimination” phe- 
nomenon. Their subjects, rats, were 
given 11 trials each day, the first al- 
ways positive, the next 10 divided be- 
tween positive and negative according 
to random orders. Their measure was 
response speed rather than choice, and 
their apparatus was essentially a run- 
way. They were trained initially to 
approach a white platform and to avoid 
a black one and, when they reached a 
criterion of discrimination defined in 
terms of approach speed, half of the 
subjects were switched to a B+W— 
discrimination immediately, the other 
half after 12 additional days of over- 
training to B—W+. The average 
number of trials to reach the original 
criterion for both groups was about 50 
(well beyond our 32-trial point). This 
means that the overtrained group had 
about 180 trials to B-W+ (well 
beyond our 120-trial point) before it 
was reversed and learned B+W-. 
'The overtrained group learned. the re- 
versed discrimination in significantly 


fewer trials. 
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Interpreting their results, Birch, 
et al. (1960) state: ". . . the differ- 
ence between the two groups appears 
to be located in their differential rates 
of extinction to the previously rein- 
forced cue. No difference in rate of 
acquisition to the newly positive cue 
was found in the present data" (p. 39). 
These results have a rather obvious 
interpretation in terms of a frustrative 
nonreward position; but the situation 
is complex and there are theoretical 
difficulties to overcome. The obvious 
part of the interpretation is that the 
overlearned group has developed 
stronger rg to W+, which makes 
subsequent W— more frustrating. 
which facilitates the conditioning of W 
to rp, which, finally, ensures the evoca- 
tion of avoidance responses by Sr 
This part is fine for the theory; how- 
ever, there is the matter of what hap- 
pens to sp, which presumably develops 
in the initial discrimination of the 
Birch, et al. experiment to the negative 
stimulus (B—) and which is connected 
to avoidance. Shouldn't the over- 
trained group have a stronger sr 
avoidance tendency in relation to 
than the criterion group? And 
shouldn't this retard the learning of 
the reversed discrimination in the 
overtrained group when B subse. 
quently becomes the positive stimulus: 
This difficulty remains for the theory 
unless one assumes that rg (and sr? 
approach) builds in strength very 
slowly over many trials (at least over 
50) while rp (and sy—avoidance) de- 
velops quickly and is maximal by about 
50 trials—this being the case despite 
the fact that the growth of rp depends 
upon the prior growth of rg. This 15 
certainly an interesting possibility. 
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theory, and presents support for the 
position that an "active" role must be 
assigned to nonreward in both motiva- 
tion and inhibition. This evidence is 
not only from experiments specifically 
designed to study frustrative proper- 
ties of nonreward, but also from a 
variety of other kinds of experiments, 
including, among others, studies of 
extinction as a function of magnitude 
of reward and number of rewards, and 
experiments in discrimination learning 
which demonstrate the relative impor- 
tance of nonrewarded and rewarded 
trials, 

_ The importance of an active concep- 
tion of nonreward in current thinking 
1S discussed with reference to simi- 
larities and differences between recent 
Cognitive-expectancy and neo-Hullian 
Interpretations of the partial reinforce- 
ment effect. The suggestion is that 
Current S-R conditioning interpreta- 
tions of these partial reinforcement 
phenomena, stressing the role of both 
anticipatory reward and anticipatory 
frustration mechanisms, and involving 
à more specific and better worked out 
language, provide a more workable 
“expectancy” theory than do cognitive 
Interpretations. 

The contention is that current neo- 
Hullian approaches, employing condi- 
tioning-expectancy concepts, go beyond 
Cognitive-expectancy approaches in 
both specificity and predictive power. 

S evidence for the merit of such a 
Position, data are presented from dis- 
crimination learning experiments which 
test predictions from a conditioning 
analysis of nonreward effects, and an 
extension of frustrative  nonreward 
theory is presented which leads to pre- 
dictions about the course of discrimi- 
Nation learning following various pre- 
discrimination experiences. These 
Predictions depend on the interaction 
of three attributes of prediscrimination 
experience: (a) whether prediscrimi- 
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nation exposure is to the stimulus 
which is eventually positive or negative 
in the discrimination (or to both) ; (b) 
whether the prediscrimination treat- 
ment involves continuous or partial 
reward; and (c) the number of 
prediscrimination exposures (trials), 
representing different stages of de- 
velopment of anticipatory reward and 
anticipatory frustration. 
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In a cue learning problem (dis- 
crimination learning, concept forma- 
tion, and maze learning) the subject 
chooses one of two or more responses 
On each trial. The correctness of the 
Tesponse depends on some aspect of 
the situation at the time of response. 

In this paper it is assumed that 
Subjects have difficulty with cue 
learning problems to the degree that 
they tend to use strategies (habits, 
patterns of response) which conflict 
With the strategy intended by the 
experimenter. It is often assumed 
that the subject must also associate 
Correct responses with cues in the 
Situation. However, in most cue 
learning experiments the subject is 
instructed or pretrained to make the 
desired responses before the cue 
learning process begins. Further- 
more, the experimental situation con- 
Strains the subject to make exactly 
one of the available responses per 
trial. It seems reasonable to assume 
that cue learning is not so much a 
matter of the formation as of the 
Selection of responses. The present 
theoretical discussion begins with a 
Set of strategies and is concerned with 
the mechanisms by which the subject 
might select out those strategies in- 
tended, and consistently rewarded, 
by the experimenter. 

. The term "strategy" is employed 
In a sense related to the more common 
terms, "habit" and "hypothesis," to 
designate a particular pattern of re- 


1 This research was facilitated by the 
author's tenure as Faculty Research Fellow 
Of the Social Science Research Council, 
1959-61. 
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sponses to stimuli. Consider, for 
example, a rat being trained to jump 
to the larger of two white circles in a 
Lashley Jumping Stand. Some pos- 
sible strategies include; jumping to 
the left side, jumping to the right 
side, jumping alternately left and 
right, jumping always to the side last 
reinforced, jumping to the side which 
last was punished, jumping in the 
direction the rat was placed down on 
the stand, jumping toward the circle 
with less (or more) acute curvature, 
jumping toward the side with more 
white area, jumping toward the side 
which reflects more total light, and so 
forth. Each such pattern of behav- 
ior, if it occurs in isolation for a suffi- 
ciently long sequence of trials, can be 
identified by the experimenter. It 
will be seen that there are ordinarily 
a large number and variety of possible 
strategies in a cue learning problem, 
and that a number of strategies may 
be perfectly confounded. In the ex- 
ample above, jumping to the circle 
with less acute curvature will always 
lead to the same result as jumping to 
the side which reflects more light, 
since these will both be jumps to the 
larger circle. 


THREE MODELS OF THE 
SELECTION OF STRATEGIES 


The general idea developed here is 
that the problem gives rise to a set of 
strategies. The subject uses these 
various strategies, which at first are 
selected at random, and attempts to 
select strategies which will consist- 
ently lead to correct responses. For 
simplicity the discussion is limited to 
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problems in which there are at least 
some strategies which are always 


. correct. 


| 


Let H be the set of strategies avail- 
able to the subject at the beginning 
of the problem. Suppose that some 
subset C of these strategies are always 
correct, another subset W are always 
wrong, and the remainder, J, are 
sometimes correct and sometimes 
wrong. For simplicity of presenta- 
tion, begin with a problem in which 
the strategies of I are correct and 
wrong, on successive trials, with in- 
dependent probabilities all of which 
are $, and restrict attention to the 
two-choice problem. 

Presentation of the theory of this 
conventional discrimination learning 
problem will begin with a special case 
in which one strategy is used on each 
trial Then an alternative theory 
will be proposed, in which the subject 
uses all strategies at once and attempts 
to narrow down to the correct one. 
Third, a model will be proposed in 
which the subject chooses a random 
sample of all strategies and attempts 
to narrow down to the correct strate- 
gies within his sample. These three 
models will be shown to be essentially 
equivalent, in a theorem which may 
be called the “indifference to sample 
size" theorem. Following proof of 
that theorem, some empirical impli- 
cations of the theory will be derived 
and compared with relevant data. 


One Strategy at a Time 


On the first trial of the experiment, 
the subject selects a certain single 
strategy from H at random and makes 
the indicated response. If the re- 
sponse is correct, the same strategy 
is used on Trial 2. : If it is again cor- 
rect, it is used again on Trial 3, and 
soforth. If, on any trial, the d 
is incorrect, the subject returns his 
strategy to the set H and then chooses 
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a strategy again at random. It is 
considered possible that the same 
strategy may be resampled, so that 
the sampling process is with replace- 
ment. Imagine that an error occurs 
on Trial 2. Then a new strategy is 
chosen and this strategy is used on 
Trial 3. If it is correct it is used 
again on Trial 4. If the response on 
Trial 3 is wrong, the subject again 
chooses an hypothesis at random for 
Trial 4. 

Notice that if the subject chooses a 
correct strategy, he will make no 
more errors. The correct strategy 
leads to correct responses, and when 
the response is correct the subject 
does not change strategy. Thus, the 
learning process is terminated by the 
choice of a correct strategy. 

Since sampling is with replacement, 
the probabilities of choosing a correct, 
à wrong, or an irrelevant strategy are 
constant from trial to trial. Let these 
three probabilities be c, w, and 2 = 
1—c-— wu. 

At the beginning of training the 
subject chooses a strategy of one of 
the three types, which have probabili- 
ties c, w, and 4. After each error he 
also chooses a strategy, with prob- 
abilities c, w, and ¿ Thus an error 1$ 
an event which returns the subject to 
exactly the same condition (in terms 
of the probabilities of future events) 
with which he started—it resets the 
process and makes it begin over. 
Such an event is called a “recurrent 
event" (Feller, 1950). Since our 
experimental interest centers around 
errors and correct responses, we can 
limit attention to errors and thus to 
the theory of recurrent events. 

Imagine that at some Trial n the 
Subject makes an error. There is à 
certain probability f; that the next 
error occurs at Trial n + 1, a prob- 
ability f? that the next error occurs at 
Trial 2 + 2, and in general a prob- 
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ability distribution f; that the next 
error after Trial z occurs at Trial 
n-+j. In a model of recurrent 
events like the present one, the dis- 
tribution f}; is the same for all Trials z 
on which an error occurs, and is the 
Same as the distribution of probabili- 
ties that the first error occurs on 
Trial 1, 2, etc. 

The random process and all of its 
properties are specified by the dis- 
tribution fj. However, the analysis 
of the process from the distribution f; 
Will be postponed until two steps have 
been completed; first, calculation of 
the distribution f; for the one-strategy- 
at-a-time model, and second, the 
formulation of two other models, very 
different in assumptions, which also 
lead to systems of recurrent events 
with the same distribution. When 
these steps are completed, the general- 
ity of the f distribution will be appar- 
ent, and derivations of properties of 
the process will be justified. 

First, fı is the probability of an 
error immediately following another 
error. 'The second error can occur 
because a wrong strategy was chosen 
(with probability w) or because an 
irrelevant strategy was chosen and 
turned out to be wrong (with prob- 
ability 27). These are mutually ex- 
clusive events and exhaust the possi- 
bilities, so that: 


fiw. 


An error ‘occuring on the second trial 
after the last one, with a correct re- 
sponse intervening, cannot result from 
the selection of a wrong strategy for 
the wrong strategy would not lead to 
an intervening correct response. If 
the error does occur on the second trial 
we know that the strategy chosen was 
not correct. Hence it must be an 
irreleyant one which led first to a 
Correct and then to a wrong response. 
In simple discrimination learning, the 
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probability that an irrelevant strategy 
would lead to a correct and then a 
wrong response is 3-4 = (3). The 
probability of choosing an irrelevant 
strategy which gives this sequence, 


correct-wrong, is therefore: 
= ete 
f2 = 7G) 
By the same reasoning, any value of 
f» j > 2, implies that an irrelevant 


strategy was chosen and then led to 
j — 1 correct responses followed ,by a 


-wrong response. The probability of 


a string of exactly j — 1 correct re- 
sponses each with probability 2, fol- 
lowed by an error with probability 
3, is (3) Hence, for j= 2: 


f; - iQ 

Summarizing the results we have the ; 
distribution : 

h2vwt& 

f; - iQ 
for all j> 2. The distribution f; is 
not a proper probability function 
because it does not sum to unity. 
Notice that: 


E E 
È )-wkiY Qy-wri-i-e 
= = 

In Feller's terms this means that 
errors are uncertain recurrent events, 
for with probability c the subject 
chooses a correct strategy and never 
makes another error. Some of the 
probability of the f distribution is 
located at the (improper) point ce, 
and the proportion so located is c. 
This merely reflects the fact that 
learning occurs in the present model 
and the subject can eliminate errors. 
The fact that f; is not a probability 
function does not place any serious 
difficulties in the way of analysis. 


AIL Strategies at Once 


We now consider a second model 
of a subject working on the same dis- 
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crimination learning problem dis- 
cussed above, with exactly the same 
set of strategies H divided into correct 
(C), wrong (W), and irrelevant (7) 
subsets. Sincea strategy corresponds 
to any specific cue in the situation 
and these cues are numerous and 
finely divisible, consider that the set 
of strategies consists of a fairly size- 
able (though finite) number of strate- 
gies, each of which is as likely to be 
used, and has as much influence on 
behavior, as every other. 

In the all-strategies-at-once model 
we imagine that the subject begins 
the learning process by considering 
the entire set M of strategies simul- 
taneously. In informal terms, he 
then attempts to narrow down to the 
correct strategy by a process of elim- 
ination. However, we suppose that 
the subject has only a limited memory 
and cannot remember all of the strate- 
gies at once, but only those with which 
he is presently concerned. The main 
disability of the subject in this model 
is that he may accidentally eliminate 
the correct strategies. When this 
happens he is unable to remember 
what they were, for he can remember 
only the ones he is still using. When 
the correct strategies have been lost 
the subject has no recourse but to 
begin over with the whole set of 
strategies. This general idea is now 
stated more precisely. 

On the first trial the subject con- 
siders all strategies and chooses his 
response. The correct strategies and 
some of the irrelevant ones lead to- 
ward a correct response, the wrong 
strategies and the remainder of UR 
irrelevant ones lead to an error. We 
suppose that the probability of a es) 

sponse is equal to the po e 
the strategies which lead to = 
response, SO that, if we let Na Vu, 
and Ni be the numbers of strategies 


in sets C W, and J, and if we let 
1 a 


Nz be the number of irrelevant 
strategies which lead to an error on 
the next trial, the probability of an 
error is: 


fi = (No + N-)/(Nwo + Ne + Nj. 


When the subject makes this first 
response we suppose that he elimi- 
nates or sets aside all the strategies he 
does not use. Suppose, for example, 
that he chooses a correct response. 
He at the same time eliminates all 
the wrong strategies and the irrelevant 
strategies which would have led him 
to the opposite response. On the next 
trial he will have only the correct 
strategies and the surviving irrelevant 
strategies left. 

To complete the discussion and 
generalize it somewhat we introduce 
notation for the set of irrelevant 
strategies which, following any Trial 
n, first lead to a wrong response on 
Trial n +j. Let N,,; be the number 
of irrelevant strategies which lead to 
correct responses on trials n + 1, 
n+2,...,2+ j — lj, andan error 
on trial n +j. Now we can write 
the probability of an error after an 
error as: 


fi = (Nw + Nayn)/(Ne + Nw + Ni): 


After one correct response, the only 
strategies which will lead to an error 
are those N, irrelevent strategies 
which first lead to an error on the 
second trial. The total strategies 
left are Ne correct and N; — Ns 
irrelevant strategies which have not 
already led toward a wrong response 
Thus, the probability of an error fol- 
lowing one correct response which 
follows a given error is: 


f (4 —f) QV. s/ EN. H- IN ;— Na a) 
Ne+Ni— Nar No 
Ne+No +N: NG-N;—Nsa 


Extension of this line of argument 
shows that in general : 


SELECTION OF STRATEGIES IN CUE LEARNING 333 


O Na+ Ni — Naa Net Ni — Naa — Nae 


f 5 
^o Ne + Nw + Ni Ne + Ni — Naa 
Ne H N; Naa Nnaji Naga Nn.i 
NaF Ni— Naa ooe Nn j-3— Nae Net Ni— Naa .—N.a- Nj 


In this expression the numerator of 
each term except the last cancels with 
the denominator of the following 
term, so that: 


Ji = Nn, i/ (Ne + No + Nj 


In our ideal experiment, Nn the 
number of strategies which first lead 
to a wrong response on Trial n + J 
after a given error on Trial z, is in- 
dependent of n and is given by IN;(2)7. 
This statement follows from the idea 
that irrelevant strategies are correct 
and wrong with independent prob- 
abilities of 4}. Actually, N;(3)? is the 
expected number of irrelevant strate- 
gles which first lead to an error on 
Trial n +j, but the derivation to 
follow does not hinge on the distinc- 
tion. 

If we substitute N;(3)/ for Na; in 
the expression for f; and simplify, 
letting 4 be the proportion of irrele- 
vant strategies in M, we obtain: 


f; = iQ. 
A similar substitution in the expression 
Or f; gives: 
hw TÉ. 
These two results are in agreement 
with the one-strategy-at-a-time 
Model. We have shown that the 
One-strategy-at-a-time and the all- 
Strategies-at-once models are both 
Systems in which errors are recurrent 
events, and we have shown that f; is 
€ same in both systems. Hence, so 
ar as errors are concerned, the two 
models are identical. 
One remark is in order. In the one- 
Strategy-at-a-time model, the proba- 
ility of a correct response on any 


trial is 1, 3, or 0 depending on whether 
acorrect, irrelevant, or wrong strategy 
is being used. The probability of a 
correct response in the all-strategies- 
at-once model can take a variety 
of values; if there are a great many 
strategies, the possible values of this 
probability are numerous and closely 
packed. The two models are not 
at all alike in the probabilities in- 
volved, and the theorem that they 
give indistinguishable sequences of 
errors is by no means obvious on 
intuitive grounds, even though the 
proof is easy. 

The correspondence of the models 
extends to the interpretation of the 
parameters. In the one-strategy-at- 
a-time model the parameters c, W, 
and i are the probabilities of selecting 
a correct, wrong, or irrelevant strat- 
egy. In the all-strategies-at-once 
model the parameters c, w, and iare 
the proportions of correct, wrong, and 
irrelevant strategies, where elemen- 
tary strategies are taken to have equal 
weight. In terms of experimental 
realizations of the models, these two 
sets of parameters are not distinguish- 
able. 

It is still possible, of course, to tell 
which model is more reasonable by 
any technique which goes beyond the 
choice data. One might, for example, 
ask human subjects which strategies 
they are using, or one might use 
memory tests to see what parts of the 
situation have been employed. The 
equivalence of the models is asserted 
only for the acquisition of a single 
problem, and only with respect to 
the overt choices made: the writer will 
suggest, in subsequent work, that rec- 
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ognition and transfer tests may per- 
mit differential predictions. 


A Random Sample of Strategies 


The two models discussed above 
are extremes. One would intuitively 
imagine that the subject would use, 
not all strategies and not just one, 
but some sample of strategies. The 
third model, called the random-sam- 
ple-of-strategies model, supposes that 
on the first trial and after each error 
the subject draws a random sample 
of all strategies. So long as he makes 
no errors the subject discards wrong 
and irrelevant strategies by the same 
mechanism as that invoked in the all- 
strategies-at-once model. If the sub- 
ject makes an error he takes a new 
independent sample from H and be- 
gins the process over. 

'The theorem to be proved is, if the 
sampling process is random in a strict 
sense to be described below and ex- 
cludes the possibility that the sub- 
ject might choose the empiy sample, 
then the random-sample-of-strategics 
model generates a system of recurrent 
events which is the same as the other 
two models given above. 

We again consider the set H of 
strategies, composed of equally potent 
elementary strategies. Suppose that 
there are N strategies in H of which 
AN, are correct, Nw are wrong, and 
N; are irrelevant. We shall interpret 
N./N as c, the proportion of correct 

f ed and N;/N as ?. 
strategies, Nwu/N as w, a 
ling model of 
In a random sampling 
: sign a fixed prob- 
learning one may assig du cadit eles 
ability of being sample : 
le size vary 
ment and let the samp 
5 “on a fixed sample- 
randomly, or assign 
; i hat all such samples 
size, supposing that 2 Estes, 1959.) 
are equally probable (Estes, ar 
aan he possibility of the 
To eliminate the p 
:t is desirable to deal 
empty sample it is t TI 
with a fixed sample size 7. 1 he 
random-sampling specification is that 


FRANK REsTLE 


all samples of size 7 are vequa ly prob- 
able. l r 

The total number of samples of size 
n which can be drawn from a popula- 


tion of N elements is: 
A 


(apa 
n] n\(N — n)! 


ñ 
docs j c 
Similarly, one can draw* ( is ) sam- 
i c 


ples of 7 correct strategies from E. 
i * og i R ach 
population of Miés and 8!€s- For ea 


of these sampla would ?rrectstrateg ies, 


there are ( ~TESPONS4ples of tw Wrong 


lacs Ot 
; $ NA No 
strategies, so that tl ere are 
Ne Nw 
ways of drawing n, correct and w 
wrong strategies. / For each of these 


UNE Ni 
combinations there are ( t ) ways 
Ni 


of drawing 7%; irrelevant strategies: 
Hence the total number of ways one 
could obtain 1; correct, nw wrong, aP 
n; irrelevant strategies is: 


COE 

( We JN tw IN Ni 

where, of course, it is assumed that 
Ne + Nw +n; =n. Since every sam- 
ple of size n is assumed to be as likely 
as any other, the probability of draw- 


ing a sample of exactly 7 correct; 
nw wrong, and n; irrelevant strate- 


gies is: 
E Nw X 58) 
Ne] N Nw ni 


©) 


which is the three-category gener: 
tion of the hypergeometric dist 
tion (Feller, 1950). f 
It was asserted, in the statement e 
the model, that once the subject d 
chosen a sample his performant” 
follows the rules of the all-strate£ic? 
at-once model. Hence, after a” 


Pn Nw, i) = 


aliza- 
ribu- 


| 
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error and the choice of a sample with 
Ney Nw, and n; strategies, we can con- 
Struct the f; distribution. We may 
write the f distribution conditional on 
the constitution of the sample as: 


fG|sample with ne, mo, mi strategies) 

= | (tw + 22/2 

GYVn:/n 
pcectly from our results on the all- 
Ri cee at once model. To obtain 
ET unconditional f; we multiply each 
Eenditional f by the probability of the 
ps and sum. This gives us the 
enon of the conditional f. 
or fi we have: 


fi = EQ 4322/1] 
= E(n,/n) + 3E(n;/n) 


Bowever, in the hypergeometric dis- 
Morton the mean proportion of 
i ong strategies in samples of size 7 
S just the proportion of wrong strate- 
De in the whole population. That 
i E(n,/n) = N/N = w. Similar 
Bes cticns hold for irrelevant and cor- 
ct strategies. Hence: 
fir=wt+ it 
and, by the same argument: 
f= d 

fed greater than 1. Thus it is shown 
Er the random-sample-of-strategies 
d odel with fixed sample size ? Is a 
System of recurrent events (this point 
1S obvious and is not proved above) 
With the same f distribution found in 
the ODe-strategy-at-a-time and the all- 
Strategies-at-once models. At this 
Point one can see that the one-strat- 
S8y-at-a-time model is a special case 


x) the random-sample-of-strategies 
Nodel with sample size ? = 1; and 


for j = 1 
forj ur 


Similarly, the all-strategies-at-once 
model is a special case with sample 
Size = y 


One further generalization is possi- 


ble. Since the final equations of the 
random-sample-of-strategies model 
contain no reference to the sample 
size n, one would get a system of re- 
current events with the given f dis- 
tribution from any mixture of systems 
which use different sample sizes. 
One can withdraw the assumption 
that the sample size is fixed, permitting 
the possibility that samples are of all 
different sizes. The only restrictions 
are that for any 2, all samples of size 
n are equally likely; and there are no 
empty samples. 

The strategy-sampling model differs 
from Estes' stimulus-sampling model 
(Estes, 1959) in an important respect. 
Let the total number of elements 
(strategies or, in Estes' case, stimulus 
elements or patterns) be V and the 
sample size be 7. In Estes' formula- 
tions, n/N is analogous to the learning 
rate 0 and controls the rate of learning 
and all the statistical characteristics 
of the acquisition data. In the pres- 
ent model, learning is independent of 
the sample size 2/N and depends 
solely on the composition of the basic 
set H; namely, the proportions of 
correct, wrong, and irrelevant strate- 
gies in H. Thus while the sampling 
process invoked in this strategy- 
sampling theory is one of those used 
by Estes, its place in the theory is 
entirely different. 

In this section it has been shown 
that three models of the selection of 
strategies all lead to the same system 
of recurrent events. Whether or not 
this is of interest depends largely on 
whether the resulting description fits 
the data of cue learning experiments, 
a question which is considered in the 


remainder of the paper. 
STATISTICAL PROPERTIES OF 
THE Raw DATA 


The discussion above has been re- 
stricted to the distribution f; the 
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probability that an error follows the 
last error by exactly j trials. The f 
distribution can be estimated directly 
from raw data, but this is neither a 
conventional nor a very interesting 
way of describing the data of cue 
learning. In this section the data 
generated by this model are analyzed 
in several of the ways commonly em- 
ployed by experimenters. Details are 
given in Restle (1961). 


Learning Curve(s) 

Consider three versions of the 
learning curve. One is the succession 
of correct and wrong responses by an 
individual subject. A second is the 
average learning curve of a group of 
subjects. The third is a corrected or 
idealized form of the learning curve 
computed by adjusting a group learn- 
ing curve (Vincentizing) or by averag- 
ing the data of subjects who are 
selected after the fact for similarity 
of over-all performance (Spence, 
1956). 

According to the present theory, 
the individual data are composed of a 
sequence of correct and wrong re- 
sponses in irregular order, followed 
by an infinite sequence of correct 
responses. If w> 0, there will be 
somewhat more errors than correct 
responses on the resolution trials, and 
there will be a tendency for errors to 
follow other errors more often than 
errors follow correct responses. The 
probability of an error following an 
error will be (w + $2)/ (w + i) where- 
as the probability of an error following 

a correct response is 3. An individual 
subject will produce such below- 
chance behavior for some block of 
trials and then abruptly, after some 
error, will either happen on a correct 
strategy (in the one-strategy-at-a- 
time model) or will begin a aon 
of elimination which ends up with all 
correct strategies. This is an extreme 
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form of the “discontinuous” of 
insightful learning curve. Unfortu- 
nately it is difficult to decide whether 
any individual subject does or does 
not exhibit this pattern, so data are 
usually combined. 

The group learning curve is merely 
the average of a set of (theoretically) 
discontinuous individual curves. 
all subjects have the same parameters 
c, w, and i, they will nevertheless 


happen to master the problem at: 


different trials, so that the average 
learning curve is gradual. Its metus 
matical form is complex but in genera 
appearance, the group learning from 
this theory resembles the common 
“growth” curve (Restle, in 1961). f 
If one selects, from a larger group 9 
subjects, those who make the same 
number of total errors, or those who 
reach criterion at the same time, anc 
averages their learning curves; Of ! 
one rescales learning trials as by the 
Vincent-curve method, the resulting 
curve will usually be S shaped. The 
writer has investigated this question 
by generating data which arise di- 
rectly from the assumptions of the 
present model, by use of tables 9 
random numbers. When, from 9 
large set of such data, one selects ? 
subgroup of "subjects" who all make 
the same number of errors, or who 
reach criterion at about the ea 
trial, and average the performance? 
within such a subgroup, the result E 
an S shaped curve. Exact Vincen 
tizing produces a flat (stationary 
learning curve before criterion. J 
The reason for the S shaped cone 
is not difficult to find; the pcr 
methods of selecting or rearrange jy 
the data tend to put the (theoretica ? 
random) times of solution close E 
gether. If the times of solution o 
put exactly together a step-functr 
should result, but if the times of 50 T 
tion are only grouped close to o 
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another, the step is blurred and an 
S shaped curve results (Spence, 1956). 


Summary Statistics of the Data 


In the present theory the actual 
trial on which learning takes place is 
not in any way fixed, but depends 
upon the random outcome of the 
process of selecting strategies. The 
result is large intrinsic variance in the 
acquisition phase, which can most 
Conveniently be described in terms 
of the total errors made by each sub- 
Ject. If each subject is trained until 
a long sequence of consecutive correct 
responses has been obtained, one is 
Teasonably sure that the number of 
errors made approximates the theoret- 
ical total errors. 

On the first trial or after any error 
the subject may either make another 
error, sometime later, or he may make 
no more errors. The probability that 
the subject will make at least one 
More error, the first one in exactly j 
trials, is fj} Thus the total probabil- 
tty of at least one more error is: 


E 
2 (f-1-—c 
rx 
With probability c the subject never 
makes another error at all. From 
this it is not difficult to show that the 
Probability of exactly k errors is 
1 — c). This is the geometric dis- 
tribution which has mean: 


E(k) = (1 — ¢)/c 
and variance: 
Var(k) = (1 — ¢)/e. 


The standard deviation of the distribu- 
tion of error scores should be nearly 
equal to its mean, according to the 
theory, and the distribution should 
Show an extreme positive skewness. 
Provided that irrelevant strategies 
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are correct just half the time at ran- 
dom, trials-to-criterion behaves very 
much like total errors. 


Methods of Estimating the Parameters 
wand c 


An important step inany mathemat- 
ical development of a learning theory 
is the estimation of parameters. 
Fortunately, in the present model 
quite simple and efficient estimates 
are available. A maximum-likelihood 
estimator of c is given by: 


6 — 1/(T +1) 


where 7 is the mean total error score 
of a group of subjects. The variance 
of this estimate, with W subjects in the 
group, Is: 


Var(é) = &(1 — ¢)/N 


It is also possible to estimate w by the 
maximum-likelihood method. As 
was mentioned earlier, a high fre- 
quency of consecutive errors in the 
presolution phase is an indication ofa 
relatively large proportion of wrong 
strategies, whereas a chance frequency 
of consecutive errors is an indication 
that there are relatively few wrong 
strategies; provided that the irrele- 
vant strategies are correct and wrong 
strictly at random. The method is 
to count “Trial 0," an imaginary trial 
before training begins, as an error. 
Then for each subject divide this ex- 
panded set of errors into Mo errors 
which are followed by correct re- 
sponses and M, errors which are fol- 
lowed by errors. Then computing 
the means of these statistics one has: 


M -M+1 
^ MTM, 


as a maximum-likelihood estimator of 
w (Restle, 1961). 
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SoME VARIATIONS OF THE 
Basic MODEL 


The theory stated above is so re- 
strictive in its assumptions that it 
cannot be applied with success to a 
great many experimental data. For 
example, with the theory above, one 
must expect the standard deviation of 
error scores to be just slightly less 
than their mean, and this is not ob- 
served with any great regularity. 
Furthermore, animal subjects often 
show a strong tendency to remain 
with one strategy even if it is wrong 
or irrelevant, for a fairly long string 
of trials. The model given above 
says that the subject starts over after 
each error, and this simply is not a 


plausible assumption for animal 
studies.? 
Perseveration 


The model is quite easily general- 
ized to take some account of persever- 
ation of strategies. In place of the 
assumption that the subject always 
resamples after an error, one supposes 
that resampling occurs with some 
probability 7, which may be consid- 
ered a constant for given experimental 
conditions and also a constant during 
the presolution period. The subject 
chooses some strategy or sample of 
strategies at Trial 1, whence the prob- 
ability of making zero errors Ir E. 
The probability that any actual error 
is the last one is only rc, the joint 
probability that (a) the subject re- 
samples and (b) having resampled, 
hits on a correct strategy (in the oug 
strategy-at-a-time model) or goes into 
a terminal process of narrowing down 
(in the all-strategies-at-once or the 
random-sample-of-strategies models.) 
For this model with perseveration the 


2 The writer is indebted to Marvin Levine 
for pointing out the importance of this factor 
in animal discrimination learning. 
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distribution of total error scores is: 


P(0) =c 
P(T) = 1 — a) (A — re)? "re. 


where T > 0. The value of this 
model in fitting animal data is illus- 
trated by analyzing a set of error 
scores reported in detail by Harlow, 
Harlow, Rueping, and Mason (1960), 

(See Restle, 1960). Harlow's distri- 
butions show a relatively high fre- 
quency of solutions with zero errors, 
along with a spread to very large 
error scores, in original learning by 
baby monkeys. The result is that 
the standard deviations of the distri- 
butions are somewhat larger than the 
means. Harlow et al. remarked on 
the high degree of perseveration shown 
by some monkeys. Even more strik- 
ing is Warren's investigation (Warren, 
1959b) of the discrimination learning 
of cats, which show strong persevera- 
tive tendencies and which give distri- 
butions of error scores with standard 
deviations much larger than the corre- 
sponding means. 

k It is not at all sure that persevera- 
tive effects can be reduced to the 
single parameter r, and the writer's 
investigations are entirely insufficient 
to support the assumption. Within 
the model there are several symptoms 
of perseveration; a high frequency of 
zero error scores relative to the re- 
mainder of the distribution, higher 
standard deviation than mean errors, 
and consecutive runs of responses 
which follow a wrong or irrelevant 
strategy, such as a position habit, ob- 
ject preferance, etc. One can imagine 
that perseverative tendencies would 
be strong whenever the incentive for 
learning is not entirely effective, or 
when the subjects (perhaps from par- 
tial reinforcement) have developed 
an expectation that rewards cannot 
always be attained. Something of 
this last effect, in college students, is 


SELECTION oF STRATEGIES IN Cur LEARNING 339 


suggested by the results of Morin 
(1955). 


Complexity of Correct Strategies 


In the model above it is tacitly 
assumed that the problem can be 
solved on the basis of a single correct 
strategy. This seems a reasonable 
assumption in the case of simultane- 
ous discrimination. However, when 
two situations are presented on suc- 
cessive trials it is a question whether 
complete solution involves learning 
one strategy (turn right to white and 
left to black) or two Strategies, one 
for each situation. If two strategies 
are learned independently and with 
the same parameters, then the dis- 
tribution of errors is the distribution 
of the sum of two random variables 
(total errors in each Situation) each 
of which has a geometric distribution. 
This results in a negative binomial 


distribution (see Restle, 1961) with 
mean: 


E(T) = 2(1 — 9c 


and variance: 
Var(T) = 2(1 — o/a. 


These statements follow at once from 
the fact that the mean of the sum of 
two independent variables is the sum 
of the means, and the variance of the 
sum is the sum of the component 
variances. With small c the standard 
deviation will be about 1/V2 or about 
0.7 of the mean. 

An interesting fact can be brought 
forward in favor of this argument, 
though the writer has not been able 
to make an exhaustive test. In a 
case of successive size discrimination 
by college students (Restle, 1955) 
the writer found ratios of standard 
deviation/mean of 1.1 and 0.9 for two 
Problems—both very close to unity, 
hence in agreement with the orig- 


inal model and with the notion that 
a single strategy is sufficient to solve 
the problem. In a comparable study 
using rats in a successive black-white 
discrimination, Amsel (1952) obtained 
data on three groups in which the 
ratio of standard deviation to mean 
was 0.65, 0.68, and 0.72. The rat re- 
sults are in excellent agreement with 
the hypothesis that the rats had to 
use two separate strategies to solve 
the problem. One might guess that 
rats do not integrate the two habits, 
right turn to white and left turn to 
black, into a single cognitive struc- 
ture, whereas humans might make 
just such an integration. The present 
analysis may be useful in throwing 
further light on the nature of suc- 
cessive discriminations for various 
species of subjects. 


ADDITIVITY OF CUES 


In several papers regarding another 
theory the writer has discussed the 
additivity of cues (Restle, 1955, 1957, 
1958, 1959a ; see also Bourne & Restle, 
1959, and Trabasso, 1960). In simple 
terms the experiment involves three 
groups; one learns a problem based 
on a set A of cues, the second learns 
a problem based on the set B of cues, 
and the third learns a problem which 
can be solved using either A or B 
cues disjunctively. If the sets .4 and 
B are separate then the third set 
4 U B should have measure m(A U B) 
= m (A) + m (B), (see Trabasso, 1960). 
Experimental results have been recon- 
ciled with an S-R theory involving 
adaptation of irrelevant cues (Restle, 
1955). 

Generally speaking, the calculations 
from the present model are in good 
numerical agreement with those from 
the "adaptation" model. Reanaly- 
Sis of three of the most satisfactory 
Sets of data used before, and one set 
not previously used, are reported 
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here. The other data previously dis- 
cussed are so fragmentary that analy- 
sis will hardly be fruitful; and Tra- 
basso (1960) used such difficult prob- 
lems that many of his subjects failed 
to learn at all on the more difficult 
problems, making analysis by the 
present model unsatisfactory. In 
general, the estimates reported below 
are not the maximum-likelihood esti- 
mates because the subjects were not 
run to a strong criterion. However, 
since learning was nearly complete, 
the approximate estimates are ade- 
quate. 

Scharlock (1955; see also Restle, 
1957) ran a place-versus-response 
experiment with rats in which the 
relative weight of correct place (extra- 
maze) and correct response (intra- 
maze) strategies can be estimated. 
He also ran one group with both place 
and response strategies correct, and a 
group with response strategies correct 
and no place cues present. In the 
calculations below it is assumed that 
the number or wrong Strategies of a 
given type (place or response) equals 
the number of correct strategies. 
This is asymmetry assumption which, 
while not strictly appropriate for 
Scharlock's experiment, is needed to 
permit prediction, since separate esti- 
mates of wrong strategies cannot be 
made on the available data. 

Rats learned to go to the same place 
(using extra-maze cues), with response 
cues irrelevant, making an average of 
9.7 errors. Using the estimate ¢ = 
1/0 + T), one estimates that the cor- 
rect place strategies make up 1/10.7 
= .093 of the total set of strategies. 
Other rats learned to make a constant 
response to different places, averag- 
ing 6.7 errors. We estimate that cor- 
rect response strategies constitute 
1/7.7 = .130 of the total set. 

The simplest interpretation of the 


experiment is that when both place 
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and response strategies would work, 
the proportion of correct strategies 
would be the sum of the proportions 
of place and response strategies, since 
the same total set of strategies is 
available. One predicts that with 
place-plus-response learning, c — .093 
+ .130 = .223. We have the formula 
that the expected errors to solution is 
(1. — e)/c. Hence the expected mean 
errors is .777/.223 = 3.4; the obtained 
mean was 4.0, which is adequately 
close. The discrepancy can partly be 
explained by the fact that the place- 
learning and response-learning groups 
were not run to a high criterion and 
probably would have made more 
errors if tested longer. The fast- 
learning Dlace-plus-response group 
would likely not have made more 
errors, since their performance was ex- 
cellent at the end of the training given. 

Another group learned a fixed re- 
Sponse in the absence of any good 
Place cues. We assume that this 
group simply had neither correct nor 
incorrect place Strategies. Its pre- 
dicted proportion of correct Strategies 
is .130/1 — 2(.093) = .130/.814 = 
-160. The expected errors is .840/ 
160 = 5.25; the observed mean was 
5.0. Both predictions are close to 
the obtained results, well within 
sampling deviations. 

Warren (19593) has reported data 
on monkeys in an experiment analo- 
gous to Scharlock's study of rats. 
Warren's monkeys had to learn posi- 
tion habits in some problems and ob- 
ject-discriminations in other prob- 
lems. Warren also used a response- 
plus-object problem (object discrim- 
ination with the objects left in the 
same place each trial) and a pure re- 
Sponse problem (e.g., choose the left 
one of two indistinguishable objects.) 
Predictions followed the same formu- 
las as above, and were extremely ac- 
curate: the proportions of object and 
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response strategies were estimated 
from behavior on object (position- 
varied) and position (object-varied) 
problems. When these values were 
added to predict object-plus-position, 
the prediction was a mean of 0.67 
errors, whereas Warren observed 0.63. 
For pure response learning, the model 
predicted 3.25 mean errors and the 
observed value was 3.04. These 
agreements between theory and data 
are well within the range of probable 
sampling error. 

Similar results were obtained by 
analyzing Warren's experiment on the 
additivity of color, form, and size 
cues (Warren, 1953). The analysis is 
substantially the same as that ‘given 
in a previous paper (Restle, 1958). 
Learning data are collected on prob- 
lems involving only color, only form, 
or only size differences between the 
objects. These data are used to cal- 
culate the proportional weights of the 
three sources of strategies, and the 
resulting values are recombined to 
compute predicted learning rates for 
problems involving two or more di- 
mensions; for example, discrimina- 
tion of a red triangle from a green 
circle involves color plus form cues. 
In each calculation account is taken 
of the greater total number of strate- 
gies involved in problems with added 
cues, and it is assumed that whenever 
a set of correct Strategies is intro- 
duced by adding cues, an equally 
large set of wrong strategies also enter 
the situation. 

Calculations on Warren's (1953) 
data afforded four predictions of total 
errors to solution. The four predic- 
tions were wrong by —17, 0, 5 and 10 
percent respectively, and all of the 
errors can reasonably be attributed 
to sampling variations. 

The hypothesis of additivity of 
cues has also been applied to human 
learning (Restle, 1959a) in an experi- 
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ment which required subjects to learn 
differential verbal responses to con- 
sonant syllables. The procedure was 
simple concept formation with indi- 
vidual letters of the syllables used as 
cues, and the same sort of additivity 
of cues as above. The adaptation 
model gave quite accurate predictions 
and the present model is, if anything, 
slightly more accurate; it predicts 
(given data on the two cues separately) 
that the added-cue group should make 
an average of 5.26 errors, whereas the 
observed mean was 5.25. Consider- 
ing the variability of the data, the 
extreme closeness is coincidental. 


COMPARISON WITH OTHER THEORIES 


The three models of the selection of 
strategies are conceptually similar to 
the writer's adaptation theory of dis- 
crimination learning (Restle, 1955). 
The "strategies" of the present model 
resemble the “cues” of the earlier 
theory. The strategy-selection model 
has, as a theorem, that the rate of 
learning depends upon the proportion 
of correct strategies. A similar idea 
is expressed in the adaptation model 
in which it was assumed that the rate 
of learning (0) would depend upon 
the proportion of relevant cues. 

Several of the serious faults of the 
adaptation theory are corrected in the 
theory of strategy-selection. First, 
in the adaptation theory the subject 
was supposed to begin conditioning 
relevant cues and adapting irrelevant 
cues, with a rate 0 equal to the pro- 
portion of relevant cues, right from 
the first trial of training. Of course, 
at that first trial the subject has no 
possible way of knowing which cues 
will turn out to be relevant—it would 
be possible for the experimenter to 
change his mind after the first trial 
is complete. Hence, the adaptation 
theory could apply only to a prescient 
subject. The selection-of-strategies 
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theory does not have the subject 
treat different strategies differently 
except on the basis of trials already 
completed, hence avoids the absurd- 
ity. Second, the idea that the struc- 
ture of the problem (proportion of 
relevant cues or correct strategies) 
controls the rate of learning was only 
a simplifying assumption, with no 
justification, in the adaptation model ; 
but is an inescapable theorem of 
strategy-selection. Third, the ad- 
aptation theory yielded a determinate 
learning curve in the sense that p(n) 
was exactly specified for a given 0 and 
p(1). The variance of typical data 
was in large part left unexplained by 
the adaptation model and had to be 
attributed to individual differences, 
even though it is notoriously difficult 
to find any strong predictor of such 
learning. In the strategy-selection 
model learning itself is a random 
event and the model generates vari- 
ability comparable with that obtained 
in the data. 

Despite these important differ- 
ences, the present theory is close 
enough to the adaptation theory to 
make it possible to carry over many 
of the theoretical insights, though in 
modified form. The one type of pre- 
diction discussed in this paper was 
additivity of cues, but other ideas 
such as the proposed basis of learning 
sets (Restle, 1958) and various quan- 
titative relationships in concept identi- 
fication (Bourne & Restle, 1959) can 
be recast in the mold of the selection 
of strategies. The, many similarities 
and few differences in predictions 
must be studied in detail and cannot 
be discussed here, except to mention 
that most of the results of the Bourne 
and Restle paper can be reproduced 
using the strategy-selection model. 

In more general terms, it may be 


ked that the strategy-selection 


r 
rema eas of Lash- 


model is similar to the id 


ley (1928) and Krechevsky (1932) in 
general intent. In comparison with 
stimulus-sampling theoreies (Restle, 
1959b) the strategy-selection model is 
like theories of the observing re- 
sponse (Atkinson, 1959a, 1959b; 
Wyckoff, 1952), except that in the 
strategy-selection model there is no 
conditioning, only observing (select- 
ing). The mathematical structure of 
the model is very close to that of 
Bower's (1960) one-element associa- 
tion model. 


SuMMARY 


A stochastic model for the solution 
of cue learning problems by the selec- 
tion of strategies was stated and de- 
veloped. Errors were shown to con- 
Stitute a system of uncertain recur- 
rent events in Feller's sense. Three 
models, one-strategy-at-a-time, all- 
Strategies-at-once, and  a-random- 
sample-of-strategies, were formulated 
and shown to yield the same system 
of recurrent events, to be identical 
in terms of data. This is the in- 
dependence-of-sample-size theorem. 
The basic distribution of the recur- 
rent-events system was used to gener- 
ate a description of the data, and also 
a method for estimating the propor- 
tions of correct and wrong strategies 
in a problem. Variations of the 
model, which take account of per- 
severation and complex strategies, 
were indicated. Experimental evi- 
dence, mainly on the additivity of 
cues, was discussed. 
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SUBJECTIVE ORGANIZATION IN FREE RECALL 
OF “UNRELATED” WORDS! 


ENDEL TULVING 


University of Toronto 


This paper is concerned with organ- 
ization as a dependent variable in free 
recall verbal learning. A method will 
be described for measuring the extent 
to which subjects’ recall of verbal 
items presented in different orders on 
successive trials is structured se- 
quentially. The method assumes no 
knowledge on the experimenter's part 
as to the sources of organization, and 
it is thus applicable to free recall 
learning of any list of verbal items. 
The effect of repetition on organiza- 
tion, and the relation of organization 
to amount of recall will be briefly 
examined in a simple experiment. 
The method and the data are dis- 
cussed in terms of the problem of the 
role of repetition in free recall verbal 
learning. 


VERBAL LEARNING AND 
ORGANIZATION 


The fact that learning occurs under 
conditions of practice, often appa- 
rently through mere repetition of the 
material, is empirically well known, 
but theoretically not yet fully under- 
stood. Indeed, the question of why 
repetition leads to better recall has 
not been raised too often explicitly, 
although it has been implicit in much 
experimental and theoretical work. 

The apparent lack of explicit inter- 
est in this problem seems to be related 
to the restricted aspects of subjects 
behavior which are studied in sen 
learning experiments. In a ie 
majority of studies the dependent 
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variables have been based on the 
operation of counting single unordered 
responses, assigned to categories such 
as "correct," “intralist intrusion," 
"remote backward association," and 
the like. Given but a single basic 
response variable of this kind, there 
is relatively little that can be done 
with the question of why repetition 
is effective, other than studying the 
effect of different independent vari- 
ables on the rate of learning. One 
can be justifiably sceptical, however, 
of our chances of ever understanding 
the basic process of verbal learning 
by simply counting "correct" and 
various classes of “incorrect” re- 
Sponses under a variety of experi- 
mental conditions. Bolles’ (1959) re- 
cent suggestion that, "If we are to 
understand human learning, we must 
determine what our Ss are really 
doing" (p. 580), can be regarded as 
an example of growing realization that 
a critical reappraisal of response vari- 
ables in verbal learning is overdue. 
An important contribution to our 
understanding of the basic acquisition 
process might well be provided by 
response variables based on ordered 
classifications of data (Miller & 
Frick, 1949). Such variables are 
particularly interesting in the light of 
G. A. Miller's (1956a, 1956b, 1956c), 
conception of verbal learning as a 
direct consequence of the process of re- 
coding or organization. Rehearsal or 
repetition, according to Miller, 1956b), 
has "the very important effect of or- 
ganizing many separate items into 
a single unit" (p. 43). Repetition 
does not change the basic storage 
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capacity of memory. Rather, organ- 
izing processes accompanying repeti- 
tion lead to an apparent increase in 
this capacity by increasing the in- 
formation load of individual units. 
Miller’s unitization hypothesis, if 
found tenable, has important impli- 
cations for research strategy and 
theory in verbal learning. One of 
the difficulties in evaluating the hy- 
pothesis lies in the lack of appropriate 
measures of organization. Organiza- 
tion as a response variable has been 
investigated by many psychologists 
(e.g., Bousfield, 1953; Cofer, 1959; 
Jenkins & Russell, 1952; Rothkopf 
& Coke, 1961), but its measurement 
has always depended on the experi- 
menter's knowledge of Sources of or- 
ganization present in the stimulus list. 
It is well known, however, that sub- 
jects are quite capable of memorizing 
materials which are not organized in 
any obvious manner, or for which the 
Sources of organization cannot be 
readily specified, Thus, if the con- 
Cept of organization is to have more 
general applicability, it seems neces- 
Sary to develop methods that permit 
Observation and quantification of or- 
ganization as a response measure in- 
dependent of the characteristics of the 
stimulus material. This is what the 
present paper attempts to do. 
Behavioral manifestations of the 
hypothesized organizing process can 
best be studied under conditions where 
the order in which the Subject recalls 
items is free to vary. Items which 
are organized into a single unit would 
then be expected to occur in close 
temporal contiguity in subject's re. 
call. Repeated occurrences of such 
Sequential patterns would indicate the 
existence of the organizing process, 
and the extent to which this occurs 
Would provide an estimate of the 
degree of organization. Thusa quan- 
titative analysis of sequential depend- 


encies among items in subject's free 
recall on successive trials would yield 
measures of a response variable closely 
related to the hypothetical organizing 
process, independently of quantity of 
recall. 

The method proposed in this paper 
is derived from information theory 
(Shannon & Weaver, 1949), and is 
quite similar to Miller and Frick's 
(1949) "index of behavioral stereo- 
typy." It provides for a measure of 
sequential redundancy in repeated 
ordered samples of a set of items. As 
the experimental paradigm used here 
involves presentation of material com- 
pletely free from any sequential re- 
dundancy, such redundancy or or- 
ganization in subject's recall cannot 
be attributed to the input material. 
Rather, it is the subject who imposes 
à certain degree of organization on the 
material. For this reason such or- 
ganization is called subjective organ- 
ization (SO). In information theory 
terms, SO is “noise” produced by the 
channel, that is the subject; it is in- 
formation in the output not found in 
the input (Miller, 1953). 


MEASUREMENT OF SUBJECTIVE 
ORGANIZATION 


Suppose we select L words, any 
words, and construct L different or- 
ders of these words such that each 
word appears in each of the Z serial 
Positions once and is preceded and 
followed by each other word just 
Once. This can be done easily if 
L +1 isa prime number. We then 
represent each of these L ordered lists 
to the subject on L separate trials. 
Words are shown one at a time on the 
memory drum. After each trial the 
Subject is asked to record all the words 
that he remembers from the list, in any 
order he wishes. After all L ordered 
lists have thus been presented we have 
L recall records from the subject, one 
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TABLE 1 
A SAMPLE RECALL MATRIX FOR AN INDIVIDUAL SUBJECT 
(2+1)th word 
nth word 

yo » ye y: y ys ye PLI ys Yo | y0 | yn | siz | yia | yu yis | y16 ni 
m i| 4) 2) 2) 1] 3 ou RR abl 16 
p 1 1 9| d 2 14 
Xa t| 2 1|.2 2 H a 12 
xs 23 1| 2| a) 2 2| 1 it 13 
Xa 1| 2 1 1| 5 3 TI 15 
Xs f 2 tb 1 2 15 
Xe PAS: 1 1 1 11 16 
x7 1 1 2 1 7 12 
Xs 2| 2 3 2 1 4 14 
Xo 1 t| 2 10 1] 15 
X10 E i| Vi 3 1 1| 2 T 13 
xu 1 6| ti 2 4 14 
“12 2| X 2 " dc S e 13 
Xi 2 9 Rz 1 1 16 
xu i 9 1 2| 1 14 
Xis 2 bl Sw Hi sS 1 14 
X16 1 2 ig 1 10 
ni 16 | 14 | 12 | 13 | 15 | 15 | 16 | 12 | 14 | 15 | 13 | 14 | 13 | 16 | 14 | 14 | 10 | N = 236 


Note.—Recall is pooled over 16 trials. 


for each trial. We then tabulate the 
frequency of all pairs of adjacent recall 
responses in a matrix consisting of 
L + 1 rows and L + 1 columns, and 
compute the measure of SO from the 
data in the matrix. 

Table 1 shows a sample recall (out- 
put) matrix for a case where L — 16 
and where the subjects recall re- 
sponses are pooled for all 16 trials. 
Symbols xi, Xe . . . xis. identifying 
the rows refer to the 1st, 2nd, . . . 
16th word in the zth position in the 
subject's recall list; symbols yı, yə 
. . . Yı identifying the columns stand 
for the same words in the (n + 1)th 
position in the subject's recall list. 
Thus x and y represent successive 
positions in the subject's recall, and 
subscripts identify the contents of 
these positions, i.e., words. , Symbol 
xo stands for the blank mend pe 
word) immediately preceding the nt 
word in the subject's recall list, an 


Yo refers to the blank position im- 
mediately following the last word in 
the same list. The entries in the 
cells of the matrix show the frequency 
with which word 7 was followed by 
word j in the subject's recall on all 16 
trials; their numerical values are 
symbolized as z;; Entries in the 
row labelled x9 show the frequencies 
with which each word j stood at the 
beginning of the subject's recall lists 
on 16 trials, and similarly entries in 
the column ys show frequencies with 
which each word i was in the last 
position of the recall lists on these 
trials. Zeros have been omitted in 
the matrix. For instance, so = 3, 
which means that Word Number 6 
was in the first position (followed no 
other word) in the subject's recall on 
three trials; #ı3 = 0, indicating that 
Word Number 3 never followed Word 
Number 1 during the 16 trials, and so 
forth. The numerical values of mar- 
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ginal totals of rows, z;, show how 
many times each word 7 appeared in 
the subject's recall on all 16 trials. 
Marginal totals for corresponding 
rows and columns are of course identi- 
cal. It is also to be noted that there 
are.no entries in the cells along the 
main diagonal, from upper left to 
lower right, since normally no word 
follows itself in recall. 


Given a recall matrix such as the 
one in Table 1, a number of different 
measures of second-order sequential 
organization could be used. The 
measure of sequential organization 
adopted in the present Paper is essenti- 
ally a measure of redundancy in a 
sequence of events when the probabil- 
ities of successive pairs of events are 
known or can be estimated : 


7 Hi(y) 
peste i 


HQ) here refers to the amount of 
information (uncertainty) in the event 
y when the preceding event x is known, 
and max H,(y) stands for the maxi- 
mum value of such uncertainty. 
This is Miller and Frick's (1949) 
second-order index of behavioral stere- 
Otypy. The formula for computing 
Second-order SO involves a minor 
modification of the above formula,? 
simplifying calculations in addition 


2 Max IH.(y) in Miller and Frick's index of 
behavioral Stereotypy is defined as max 
H(x,y) — max A(x). Thus it is based on 
the assumption that all responses are equally 
frequent. For the present problem, however 
it may be somewhat more meaningful to com. 
pute max’ H;(y) on the basis of the given fre- 
quency of individual responses, i.e., as deter- 
mined by subject’s recall: 


max' H,(y) = max H (x,y) — H (x) 


If max' H.(y) is substituted for max Hy) 
in Formula 1, that formula becomes mathe- 
matically equivalent with Formula 2, the 
measure of subjective organization as used in 
this paper, 


- 


to being somewhat more appropriate 
to the free recall learning situation. 
This formula is as follows: 


È nis log 1;;—min X z1log ni; 
ij ij 
~ Yn; log 1;—min 55 ni; log 1 
i ái [2] 
where ;/;; represents the numerical 
value of the cell in the ith row 
and jth column, z; represents the 
marginal total of the ith row, and 
min Ð 71; log 7;; represents the mini- 


SO 


£3 
mum value that 5; zi; log zi; can 
hj 
assume. 


In cases where no marginal total 
n; exceeds the length of the list L, 
min 25 2;; log nz is equal to 0, and 


ij ; 
Formula 2 reduces to the following 
relatively simple expression : 


Dd ni; log niz 


i 
a D n; log ni [3] 


In this formula Y; n; log n; can be 


i 
regarded as a measure of maximum 
organization, 57 z1;; log n:; represents 


LJ 

the actual organization, and SO is 
simply a measure of actual organiza- 
tion relative to the maximum. Thus 
SO can assume all values between zero 
and unity, the former expressing the 
complete absence of second-order se- 
quential organization, as in the case 
of the ordered stimulus lists presented. 
to the subject, and the latter the 
maximum degree of such organiza- 
tion, as in the case of an imaginary 
subject who recalls all words on all 
trials exactly in the same order. SO 
for the data in Table 1, incidentally, 
is .397. 

So far we have considered only 
Second-order sequential organization, 


348 ENDEL TULVING 


based on pairs of successive responses. 
The method can be easily extended 
to higher-order dependencies, even 
though the computational labors in- 
volved would become quite prohibi- 
tive. However, estimates of higher- 
order organization can be obtained 
more easily. For instance, instead of 
setting up a three dimensional matrix 
for calculating third-order SO, we 
can tabulate data in a two dimensional 
matrix with nth items as rows and 
(n + 2)th items as columns. In this 
case we also need two rows for blank 
spaces preceding the first word (xo 
and xo-1), and two columns for blank 
spaces following the last word in each 
recall list (yo and yo,1). This proce- 
dure amounts to collapsing the three 
dimensional matrix along the (n+ 1)th 
dimension. That is, we are comput- 
ing an SO measure based on pairs of 
responses separated by one response. 
Such an SO is labeled SO (Lag 1), 
and it does provide an estimate of the 
third-order SO. In a similar manner, 
still higher order dependencies can 
be evaluated. SO (Lag 2) is an esti- 
mate of the fourth-order sequential 
organization, SO (Lag 3) is an esti- 
mate of the fifth-order SO, and so 
forth. 

SO can be calculated for any block 
of trials, usually successive trials. 
Thus it is sometimes useful to indicate 
the size of the block on which the 
measure is based. For instance, SO 
(Blks — 3) means SO calculated from 
blocks of three successive trials. SO 
for a single trial is not meaningful, 
as subjective organization is defined 
in terms of the subject's tendency 
to recall words in the same order on 
successive trials; in fact SO for a 
single trial, when calculated according 
to the formulas given, turns out to be 
0, provided that the subject has not 


repeated any pairs of words. 


DEMONSTRATION EXPERIMENT: 
SUBJECTIVE ORGANIZATION AND 
PERFORMANCE 


The following simple experiment 
will serve as an illustration of how the 
method works, and the data from the 
experiment will provide tentative 
answers to three questions which can 
be raised in the context of the dis- 
cussion of the concept of organization 
in the first section. The questions 
are as follows: First, do the subjects 
actually organize "unrelated" words 
when instructions are simply to recall 
as many words as they can? Second, 
does this subjective organization , if 
found in the data, increase with repe- 
tition of the material? Third, is there 
a systematic relation between sub- 
jective organization and more tradi- 
tional measures of learning, such as 
the number of words correctly re- 
called ? 


Procedure 


A list of 16 English words was adopted from 
another verbal learning experiment (List III 
in Tulving & Thornton, 1959). All words 
were disyllabic nouns, consisting of five, six, 
or seven letters. The words, here listed 
alphabetically, were as follows: ACCENT, BAR- 
RACK, DRUMLIN, FINDING, GARDEN, HOYDEN, 
ISSUE, JUNGLE, LAGOON, MAXIM, OFFICE, 
POMADE, QUILLET, TREASON, VALLEY, WALKER. 
Sixteen different sequences of these words 
were constructed such that, considering any 
block or all 16 orders of the stimulus list, 
there was no second-order or higher-order 
redundancy in the lists. 

Sixteen female undergradute students en- 
rolled in the introductory psychology courses 
at the University of Toronto served as sub- 
jects. Their median age was 19 years. None 
of the subjects had previously participated in 
any verbal learning experiments. 

Each subject was tested individually. In 
the instructions the subject was told that her 
task was to learn a list of 16 words, two-syl- 
lable English nouns, that would be presented 
to her on 16 separate trials. At the end of each 
trial she was to write down as many words 
from the list as she remembered. ‘Then she 
would be shown the same words again, in à 
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different order, and at the end of the trial she 
would again write down all the words that she 
was able to recall. The subject was also told 
that the order in which she recalled the words 
did not matter, her task was simply to recall 
as many of the words on each trial as she 
could. 

The 16 different orders of the stimulus list 
were typed on white paper in lower case letters 
and presented to the subject on a memory 
drum. The rate of exposure was one word 
per second, At the end of the trial the sub- 
ject was given 90 seconds to record her recall 
on sheets of paper lined with 16 consecutively 
numbered lines, 


The order in which the 16 ordered lists were 
presented to the subjects was systematically 
counterbalanced among subjects. The 
method used for assigning the 16 orders of the 
stimulus list to the 16 subjects on the 16 trials 
was the same as that used in assigning the 16 
words to 16 serial positions in 16 orders of the 
stimulus list. 


Results 


Two kinds of data are of interest in 
this experiment, performance (P) 
defined in terms of frequency of cor- 
rect recall, and subjective organiza- 
tion (SO). Extralist intrusions and 
misspelled words from the list were 
ignored in computing both P and SO 
Scores? Repetitions of list words 
within a given trial were included in 
the recall matrix and thus entered 
the SO score, but not the P Score. 
The mean number of extralist intru- 
sions, together with misspelled words, 
was 0.13 per trial, the mean number 
of repetitions of list words was 0.09 
per trial. 

Second-order SO scores were c 
lated for successive blocks of two 
trials (SO, Blks — 2), for running 
blocks of three trials (SO, Blks — 3), 
and for the total block of 16 trials 


alcu- 


3 Such extralist intrusions are of interest in 
their own right and may provide valuable 
information about the organizing process, 
In order to handle them in the present situ- 
ation, the recall matrix can be simply extended, 
However, such intrusions can probably be 
better studied in different experimental situ- 
ations (e.g., Deese, 1959). 
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Fic. 1. Mean performance (upper curve) 
and mean SO (lower curve) as a function of 
trials. (Values of performance are to be read 
from the left ordinate, SO from the right 
ordinate. Mean SO for blocks of two trials 
from 16 statistical subjects are shown for 
comparison, open triangles. For further 
explanation see the text.) 


(SO, Blks = 16). SO (Blks = 2) 
Scores were also computed for 16 
statistical subjects.* 

The effect of repetition on both P 
and SO is summarized graphically in 
Figure 1. The values of P are to be 
read from the left, the values of SO 
from the right hand ordinate. The 
abscissa represents trials. The upper 
curve is the typical learning curve, 
showing performance as a function of 
trials. The lower curve, fitted by 
inspection to data points representing 
SO (Blks — 3), could be called an 


4 IBM Model 650 Electronic Data Process- 
ing Machine was used in these calculations. 
Thanks are due to Albert S. Bregman for 
constructing the program. Data from statis- 
tical subjects, whose ‘performance’ was 
matched with that of individual experimental 
subjects, were collected as an attempt to 
estimate the amount of organization occurr- 
ing by chance, since the sampling distribution 
of SO, under the null hypothesis of no or- 
ganization, is not known. 
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"organization" curve. The SO for 
each block. is plotted against the 
middle trial of the block. Thus there 
are 14 SO (Blks — 3) scores. SO 
scores for successive blocks of size 
two are also shown for both the real 
and the statistical subjects, plotted 
against the midpoints of the trial 
blocks. 

These data clearly demonstrate 
that subjects do in fact organize their 
recall sequentially even in the absence 
of such sequential organization in 
stimulus lists, and that this organiza- 
tion increases systematically with 
repeated exposures to, and recall of 
the material. Thus, in an experi- 
mental situation such as the present 
one, repetition has two parallel effects : 
increasing frequency of recall is ac- 
companied by an increasingly tighter 
sequential organization. 

The slopes of the two curves de- 
picted in Figure 1 seem to be different. 
When mean P scores are plotted 
against mean SO (Blks — 3) for the 
14 trials, the curvilinearity of the rela- 
tion is obvious. However, the rela- 
tion becomes quite linear when a 
logarithmic transformation is applied 
to the mean SO scores, suggesting 
that, as a first approximation, increase 
in performance is proportional to in- 
crease in log SO. The product-mo- 
ment correlation between mean P 
and log mean SO (Blks — 3) for 
Trials 2 to 15 was found to be +.96, 

Estimates of higher-order SO were 


TABLE 2 


MEANS AND STANDARD DEVIATIONS OF SO 
(Blks = 16) ron VARIOUS Lacs 


Group Lag 0| Lag 1| Lag 2| Lag 3 


Experimental subjects 


Mean .224 | .223 | .216 


i) 
© 
L3 


Standard deviation | 1054 | .045 | .034 | .035 
Statistical subjects 1 
“Mean : .180 16 

Standard deviation O17 l 


Note.—N = 16 for each group. 


Enver TurviNG 


TABLE 3 


INTERCORRELATIONS AMONG SO (Blks— 16). 
MEASURES ron Lacs 0, 1, 2, AND 3; AND 
CoRRELATIONS BETWEEN THESE SO | 
SconEs AND MEAN P SCORES ON 
TRIALS 1 TO 8, 9, ro 16, AND | 
1 TO 16 FOR INDIVIDUAL 


SUBJECTS 
so 
Measures 

Lag 0 | Lag 1 | Lag 2 | Lag 3 
SO (Lag 1) +.86 
SO (Lag 2) +.62 | 4-85 
SO (Lag 3) +.58 | +.74 | +.84 
P (Trials 1 to 8) 45 32 | 4-24 | 4-01 
P (Trials 9 to 16) | +.78 nr +.34 | +19 
P (Trials 1 to 16) | +.63 | -+47 | 4-30 | +:08 


Note.—N = 16, 


obtained in the form of SO scores for 
Lags 1, 2, and 3, for the total block of 
16 trials. The mean SO (Blks = 16) 
scores for these lags and their standard 
deviations are shown in Table 2, 
together with comparable data from 
the statistical subjects for Lags 0 and 
3. Even though mean SO scores 
seem to decrease with increasing lags, 
it is interesting to note that SO (Lag 3, 
Blks = 16) scores are higher than the 
same scores from the statistical sub- 
jects. The median test yielded a 
chi-square of 15.12, which is highly 
significant.’ 

The intercorrelations among SO 
(Blks = 16) scores for Lags 0, 1, 2, 
and 3, as well as correlations between 
these SO scores and several average 
P measures are shown in Table 3. 
These correlations are based on indi- 
vidual data from 16 subjects. 

Certain orderly relations appear in 
Table 3. First, the correlations be- 
tween any two adjacent SO measures 
on the lag dimension are reasonably 
high, all three listed in the table being 
approximately +.85. Second, with 
increasing distance on the lag dimen- 
sion between any two SO measures 
the correlations decrease. Thus, for 
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example, SO (Lag 0) has a correlation 
of +.85 with that for Lag 1, +.62 
with Lag 2, and +.58 with Lag 3. 
"Third, correlations between P and 
SO decrease with increase in the lag of 
the SO measure. The most import- 
ant conclusion to be drawn from these 
data is that, within the limits of the 
present method, the second-order SO 
is about as useful a measure of se- 
quential organization as that based 
on any combination of different 
orders, and more useful than any 
other single higher-order SO measure. 

Positive correlations between SO 
and P scores in Table 3, where correla- 
tion is done on data from 16 subjects, 
support the observation of the same 
relation over trials. It is also inter- 
esting to note that on the basis of a 
single estimate of organization for 
each subject, that based on the total 
block of 16 trials, SO accounts for a 
larger amount of variance in the P 
variable for Trials 9 to 16 than for 
Trials 1 to 8. 

-All the above findings relate to 
organization as an intrasubject phe- 
nomenon. Commonality of organ- 
ization between subjects, however, can 
also be investigated by using the same 
general method. That there might 
be such commonality became appar- 
ent in the inspection of recall data 
from different subjects. Recall ma- 
trices from individual subjects showed 
that quite often the patterning of re- 
sponse sequences was similar for many 
subjects. This phenomenon was in- 
vestigated in two ways. 

First, the recall data from all 16 
subjects were pooled for all 16 trials 
and entered into an "'intersubject re- 
call matrix." Ifall subjects organized 
their recall differently, the distribu- 
tion of cell entries in the group recall 
matrix would not be significantly 
different from the distribution under 
conditions where only chance factors 
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Fic. 2. Mean intersubject SO scores for 
16 subjects on eight successive blocks of two 
trials. 


are operating. To test the signifi- 
cance of the apparent deviation of the 
data in this matrix from chance dis- 
tribution, the information function 
T(x,y) was computed. It was found 
to be 0.352 bits which is significant at 
better than .01 level, using the method 
suggested by Miller (1955). This 
finding thus confirms the casual ob- 
servation that there is a certain degree 
of commonality in the subject's recall. 

The second method used to explore 
intersubject organization was as fol- 
lows. For a single trial data were 
pooled in a recall matrix from all 16 
subjects. For each such intersubject 
recall matrix SO was computed. This 
procedure results in 16 intersubject 
SO measures, one for each trial. 
These data are shown in Figure 2. 
Intersubject SO measures have been 
averaged for blocks of two trials, in 
order to smooth the curve. In view 
of the orderly increase in the intersub- 
ject SO measure with repetition, it 
would seem safe to conclude that the 
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commonality of organized recall se- 
quences between subjects increases 
with repetition of the material. 


DISCUSSION 


All three questions posed at the 
beginning of the experiment have been 
answered in the affirmative. It seems 
that the subjects do impose a se- 
quential structure on their recall, that 
this subjective organization increases 
with repeated exposures and recall 
of the material, and that there is a 
positive correlation between organ- 
ization and performance. None of 
these findings is very surprising, and 
all are in good agreement with related 
experimental evidence. Bousfield 
and his associates, as well as other 
investigators, have amply demon- 
strated that there is a strong tendency 
in subjects to recall randomly pre- 
sented material in sequences of re- 
lated words or clusters (Bousfield, 
1953; Cohen & Bousfield, 1956; Jen- 
kins & Russell, 1952). It has also 
been shown that repeated presenta- 
tions of the stimulus word list in- 
creases this clustering tendency (Bous- 
field & Cohen, 1953), and that there 
is a direct relation between the degree 
of clustering and the amount of recall 
(Bousfield & Cohen, 1955; Bousfield, 
Cohen, & Whitmarsh, 1958; Jenkins, 
Mink, & Russell, 1958; Sakoda, 1956). 
The findings of the present experiment 
extend the domain of these phenomena 
of organization from one trial recall 
and experimentally organized ma- 
terials, as used in all the above ex- 
periments, to a learning situation, 
with several successive trials, and 
experimentally unorganized materials, 

The finding that the organizing 
effects, well demonstrated in experi- 
ments on clustering, can be experi- 
mentally assessed even in case of un- 
related words, is quite encouraging. 
Perhaps paradoxically this suggests 


that a list of completely unrelated 
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words is probably as fictional as is a 
truly nonsensical nonsense syllable. 
It is for this reason that reference has 
been made throughout this paper to 
"unrelated" words, referring only to 
the fact that words are experimentally 
unselected as to their meaning. 

The present method of quantifying 
sequential organization in free recall 
learning constitutes only a rather 
coarse net for capturing the process of 
organization in all its diversity. For 
one thing, the SO measure is most 
sensitive to rigid sequences of re- 
Sponses, or chunks (Miller, 1956c). 
It is quite likely, however, that a great 
deal of organization occurs in the form 
of clusters (Bousfield, 1953), in which 
items have no fixed order. This 
would tend not only to attenuate 
measures of SO, but also, through 
increased variability of SO, depress 
correlations with the performance 
variable. 

It would be interesting, in view of 
Miller's unitization hypothesis, to 
look at organization in terms of its 
units. In many cases subjects un- 
doubtedly impose a hierarchical or- 
ganization on recall, or use a Plan 
(Miller, Galanter, & Pribram, 1960). 
In other cases, particularly when the 
list to be learned is short and some 
coding device such as the alphabet 
can be employed, nonhierarchical or- 
ganization may also occur. The pres- 
ent method does not lend itself to 
an identification of various strategies 
adopted by subjects, even though 
casual inspection suggests many di- 
Verse sources of organization: associ- 
ative grouping, conceptual categories, 
assonance, grouping in terms of famil- 
larity of items, and so forth. 

The method provides only an over- 
all measure of organization. This 
measure is related to the size of the 
organizing units in a systematic man- 
ner, but the relation cannot be easily 
specified. The two extremes of the 
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function, however, are determined by 
the definition of the SO measure, and 
the transition from one to the other 
must be gradual. In one case the 
units of organization are individual 
items in the list, resulting from previ- 
ous response integration (Mandler, 
1954) outside the laboratory. In this 
case the probability of two or more 
items being recalled consistently to- 
gether is no greater than what might 
be expected by chance, and the nu- 
merical value of SO is approximately 
the same as that of statistical sub- 
jects. On the other extreme, the 
whole set of items is organized into 
one unit, and SO is at the maximum. 

Assuming a systematic relation be- 
tween the size of the organizing units 
and the numerical value of SO, the 
finding that the second-order SO ac- 
counts for a larger proportion of vari- 
ance in individual P scores than do 
estimates of higher-order measures 
makes good sense. Some units of 
organization contain only two items, 
and, given a flexible order of units in 
recall, these are not tapped by the 
third-order and higher-order measures 
of SO, while the second-order measure 
is sensitive to all units of two or more 
Items. This interpretation is also 
consistent with the observed inter- 
correlations among SO measures for 
different lags. Although these meas- 
ures are all based on identical se- 
quences of responses, the correlations 
are less than perfect. This probably 
reflects the fact that distributions of 
size of organizing units vary from sub- 
ject to subject. 

What is the significance of the 
method and the preliminary findings 
for the question originally posed about 
the effects of repetition on perform- 
ance? It seems that response vari- 
ables such as SO, based on ordered 
classifications of data, might be quite 
useful in our attempts to shed more 
light on the nature of the acquisition 


process in verbal learning. And the 
present findings seem to have con- 
tributed to the attractiveness of 
Miller’s unitization hypothesis (1956a) 
as a serious beginning of a useful 
theory of free recall verbal learning. 
Even though intuitively there seems 
little doubt that performance de- 
pends on organization, the correla- 
tional design used in the present study 
does not yet permit such a conclusion, 
and other kinds of experiments are 
necessary. Given a method of quan- 
tifying behavioral manifestations of 
the organizing process, it is quite 
possible to test many implications of 
the unitization hypothesis. 

It is to be noted that the concept of 
repetition has been specified here 
rather loosely. There are at least 
two operationally distinguishable 
phases which should be studied sep- 
arately, presentation of material and 
test for recall. A conceptual and ex- 
perimental clarification of this im- 
portant concept is clearly needed. 

The observation of common recall 
sequences among different subjects 
and of increased stereotypy of such 
intersubject organization under con- 
ditions of practice very strongly sug- 
gests, or rather confirms the expecta- 
tion, that sources of organization are 
discovered by subjects in the material, 
rather than invented idiosyncrati- 
cally. Many task variables, therefore, 
are expected to influence both subjec- 
tive organization and recall of the 
material, and a systematic exploration 
of these relations constitutes an im- 
portant part of the research program 
designed to evaluate the tenability of 
the unitization hypothesis. 


SUMMARY 


A method for examining and quanti- 
fying sequential dependencies in the 
subjects’ free recall of words on suc- 
cessive recall trials has been presented. 
Subjective organization as a depend- 
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ent variable was defined in terms of 
the subject's tendency to recall items 
in the same order on different trials in 
the absence of any experimentally 
manipulated sequential organization 
among items in the stimulus list. In 
a preliminary experiment it was found 
that the subject's recall behavior 
manifests such subjective organiza- 
tion, that this organization increases 
with repetition, and that there is a 
positive correlation between organ- 
ization and performance. These data 
were discussed with reference to the 
problem of the role of repetition in 
free recall verbal learning. 
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THE ACQUISITION OF KNOWLEDGE' 
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The growing interest in autoin- 
structional devices and their component 
learning programs has had the effect 
of focusing attention on what may be 
called “productive learning.” By this 
phrase is meant the kind of change in 
human behavior which permits the in- 
dividual to perform successfully on an 
entire class of specific tasks, rather 
than simply on one member of the 
class. Self-instructional programs are 
designed to ensure the acquisition of 
capabilities of performing classes of 
tasks implied by names like "binary 
numbers," “musical notation," "solving 
linear equations" rather than tasks re- 
quiring the reproduction of particular 
responses. 

When viewed in this manner, learn- 
ing programming is not seen simply as 
a technological development incorpo- 
rating previously established learning 
principles, but rather as one particular 
form of the ordering of stimulus and 
response events designed to bring 
about productive learning. It should 
be possible to study such learning, and 
the conditions which affect it, by the 
use of any of a variety of teaching ma- 
chines, although there are few studies 
of this sort in the current literature 
(cf. Lumsdaine & Glaser, 1960). In 
the laboratory, the usual form taken by 
studies of productive learning has been 
primarily that of the effects of instruc- 
tions and pretraining on problem soly- 
ing (eg., Hilgard, Irvine, & Whipple, 


1This study was made possible in part by 
funds granted by the Carnegie Corporation 
of New York. The opinions expressed are 
those of the author, and do not necessarily 
reflect the views of that Corporation. 


1953; Katona, 1940; Maltzman, Eis- 
man, Brooks, & Smith, 1956). 

When an individual is subjected to 
the situation represented by a learning 
program, his performance may change, 
and the experimenter then infers that 
he has acquired a new capability. It 
would not be adequate to say merely 
that he has acquired new "responses," 
since one cannot identify the specific 
responses involved. (Adding frac- 
tions, for example, could be repre- 
sented by any of an infinite number of 
distinguishable stimulus situations, and 
an equal number of responses.) Since 
we need to have a term by means of 
which to refer to what is acquired as 
a result of responding correctly to a 
learning program, we may as well use 
the term "knowledge." By definition, 
"knowledge" is that inferred capability 
which makes possible the successful 
performance of a class of tasks that 
could not be performed before the 
learning was undertaken. 


Some Initial Observations 


In a previous study of programmed 
learning (Gagné & Brown, 1961) sev- 
eral kinds of learning programs were 
used in the attempt to establish the 
performance, in high school boys, of 
deriving formulas for the sum of n 
terms in a number series. Addi- 
tional observations with this material 
led us to the following formulation : 
In productive learning, we are dealing 
with two major categories of variables. 
The first of these is knowledge, that 
is, the capabilities the individual pos- 
sesses at any given stage in the learn- 
ing; while the second is instructions, 
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the content of the communications pre- 
sented within the frames of a learning 
program. 

In considering further the knowl- 
edge category, it has been found pos- 
sible to identify this class of variable 
more comprehensively in the following 
way: Beginning with the final task, the 
question is asked, What kind of capa- 
bility would an individual have to 
possess if he were able to perform this 
task successfully, were we to give 
him only instructions? The answer 
to this question, it turns out, identifies 
a new class of task which appears to 
have several important characteristics. 
Although it is conceived as an internal 
"disposition," it is directly measurable 
as a performance. Yet it is not the 
same performance as the final task 
from which it was derived. It is in 
some sense simpler, and it is also more 
general. In other words, it appears 
that what we have defined by this pro- 
cedure is an entity of "subordinate 
knowledge" which is essential to the 
performance of the more specific final 
task. 

Having done this, it was natural to 
think next of repeating the procedure 
with this newly defined entity (task). 
What would the individual have to 
know in order to be capable of doing 
this task without undertaking any 
learning, but given only some instruc- 
tions? This time it seemed evident 
that there were two entities of sub- 
ordinate knowledge which combined in 
support of the task. Continuing to 
follow this procedure, we found that 
what we were defining was a hierarchy 
of subordinate knowledges, growing in- 
creasingly "simple," and at the same 
time increasingly general as the defin- 
ing process continued. : . 

By means of this systematic analysis, 
it was possible to identify nine sepa- 
rate entities of subordinate knowledge, 
arranged in hierarchical pmi [o 

Figure 1). Generally stated, 
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hypothesis was that (a) no individual 
could perform the final task without 
having these subordinate capabilities 
(ie, without being able to per- 
form these simpler and more general 
tasks) ; and (b) that any superordinate 
task in the hierarchy could be per- 
formed by an individual provided 
suitable instructions were given, and 
provided the relevant subordinate 
knowledges could be recalled by him. 

It may be noted that there are some 
possible resemblances between the enti- 
ties of such a knowledge hierarchy and 
the hypothetical constructs described 
by three other writers. First are the 
habit-family hierarchies of Maltzman 
(1955), which are conceived to medi- 
ate problem solving, and are aroused 
by instructions (Maltzman et al, 
1956). The second are the “organiza- 
tions" proposed by Katona (1940), 
which are considered to be combined 
by the learner into new knowledge 
after receiving certain kinds of instruc- 
tions, without repetitive practice. The 
third is Harlow’s (1949) concept of 
learning set. Harlow's monkeys ac- 
quired a general capability of success- 
fully performing a class of tasks, such 
as oddity problems, and accordingly 
are said to have acquired a learning 
set. There is also the suggestion in 
one of Harlow's (Harlow & Harlow, 
1949) reports that there may be a 
hierarchical arrangement of tasks more 
complex than oddity problems which 
monkeys can successfully perform. 
Since we think it important to imply 
a continuity between the relatively 
complex performances described here 
and the simpler ones performed by 
monkeys, we are inclined to refer to 
these subordinate capabilities as “learn- 
ing sets.” 


REQUIREMENTS OF THEORY 


If there is to be a theory of produc- 
tive learning, it evidently must deal 
with the independent variables that can 
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be identified in the two major cate- 
gories of instructions and subordinate 
capabilities, as well as with their inter- 
actions, in bringing about changes in 
human performance. 


Instructions 


Within a learning program, instruc- 
tions generally take the form of sen- 
tences which communicate something 
to the learner. It seems possible to 
think of such “communication” as 
being carried out with animals lower 
than man, by means of quite a dif- 
ferent set of experimental operations. 
Because of these communications, the 
human learner progresses from a point 
in the learning sequence at which he 
can perform one set of tasks to a point 
at which he achieves, for the first time, 
a higher level learning set (class of 
tasks). What functions must a theory 
of knowledge acquisition account for, 
if it is to encompass the effects of in- 
structions? The following paragraphs 
will attempt to describe these func- 
tions, not necessarily in order of im- 
portance. i 

First, instructions make it possible 
for the learner to identify the required 
terminal performance (for any given 
learning set). In educational terms, it 
might be said that they "define the 
goal" For example, if the task is 
adding fractions, it may be necessary 
for the learner to identify 1534 as an 
adequate answer, and 934 as an inade- 
quate one. 

Second, instructions bring about 
proper identifications of the elements 
of the stimulus situation. For ex- 
ample, suppose that problems are to 
be presented using the word "fraction." 
The learner must be able to identify 27 
as a fraction and .4 as not a fraction, 
Or, he may have to identify X as “sum 
of," and n as "number." Usually, in- 
structions establish such identifications 
in a very few repetitions, and some- 
times in a single trial. If there are 


many of them, differentiation may re- 
quire several repetitions involving con- 
trasting feedback for right and wrong 
responses. : 
A third function of instructions is 
to establish high recallability of learn- 
ing sets. The most obviously manipu- 
lable way to do this is by repetition. 
However it should be noted that repeti- 
tion has a particular meaning in this 
context, It is not exact repetition of 
a stimulus situation (as in reproductive 
learning), but rather the presentation 
of additional examples of a class of 
tasks. Typically, within a learning 
program, a task representing a particu- 
lar learning set is achieved once, for 
the first time. This may then be fol- 
lowed by instructions which present 
one or more additional examples of this 
same class of task. “Variety” in such 
repetition (meaning variety in the 
stimulus context) may be an important 
subvariable in affecting recallability. 
Instructions having the function of es- 
tablishing high recallability for learn- 
ing sets may demand "recall," as in the 
instances cited, or they may on other 
occasions attempt to achieve this effect 
by "recognition" (ie. not requiring 
the learner to produce an answer). 
The fourth function of instructions 
is perhaps the most interesting from’ 
the standpoint of the questions it 
raises for research. This is the 
"guidance of thinking," concerning 
whose operation there is only a small 
amount of evidence (cf. Duncan, 1959). 
Once the subordinate learning sets 
have been recalled, instructions are 
used to promote their application to 
(or perhaps “integration into ) the 
performance of a task that is entirely 
new so far as the learner is concerned. 
Ata minimum, this function of instruc- 
tions may be provided by a statement 
like “Now put these ideas together to 
solve this problem;" possibly this 
amounts to an attempt to establish a 
set. Beyond this, thinking may be 
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guided by suggestions which progres- 
sively limit the range of hypotheses 
entertained by the learner, in such a 
way as to decrease the number of in- 
correct solutions he considers (cf. 
Gagné & Brown, 1961; Katona, 1940). 
Within a typical learning program, 
guidance of thinking is employed after 
identification of terminal performance 
and of stimulus elements have been 
completed, and after high recallability 
of relevant learning sets has been en- 
sured. In common sense terms, the 
purpose of these instructions is to sug- 
gest to the learner “how to approach 
the solution of a new task” without, 
however, “telling him the answer.” 

Obviously, much more is needed to 
be known about the effects of this vari- 
able, if indeed it is a single variable, 
Initially, it might be noted that guid- 
ance of thinking can vary in amount ; 
that is, one can design a set of instruc- 
tions which say no more than “now do 
this new task" (a minimal amount) ; 
or, at the other end of the scale, a set 
of instructions which in effect suggest 
a step-by-step procedure for using pre- 
viously acquired learning sets in a new 
situation. 


Subordinate 
Sets 


: When one begins with the perform- 
ance of a particular class of tasks as 
a criterion of terminal behavior, it is 
possible to identify the subordinate 
learning sets required by means of the 
procedure previously described. The 
question may be stated more exactly 
as, "What would the individual have 
to be able to do in order that he can 
attain successful performance on this 
task, provided he is given only instruc- 
tions?” This question is then applied 
successively to the subordinate classes 
of tasks identified by the answer, 
“What he would have to be able to do” 
is in each case one or more perform- 
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ances which constitute the denotative 
definitions of learning sets for particu- 
lar classes of tasks, and totally for the 
entire knowledge hierarchy. 

A theory of knowledge acquisition 
must propose some manner of func- 
tioning for the learning sets in a 
hierarchy. A good possibility seems 
to be that they are mediators of posi- 
tive trausfer from lower-level learning 
Sets to higher-level tasks. The hy- 
pothesis is proposed that Specific trans- 
fer from one learning set to another 
standing above it in the hierarchy will 
be zero if the lower one cannot be 
recalled, and will range up to 100% if 
it can be, 

Tn narrative form, the action of the 
two classes of variables in the acquisi- 
tion of knowledge is conceived in the 
following way, A human learner be- 
gins the acquisition of the capability of 
performing a particular class of tasks 
with an individual array of relevant 
learning sets, Previously acquired. He 
then acquires new learning sets at pro- 
gressively higher levels of the knowl- 
edge hierarchy until the final class of 
tasks is achieved, Attaining each new 
learning set depends upon a process of 
Positive transfer, which is dependent 
upon (a) the recall of relevant sub- 


ordinate learning sets, and upon (D) 
the effects of instructions, 
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-TASK- 


Finding formu 


TIA. Using symbolo to identify 
spatial rolationshipa between 
numbers in different rowa and 
columna of a table 


IVA. Using symbole 
to locate vertical 
ton and identity 

in columna of 


subacripts to locate 
and identify numbero 
in rows of tables 


Symbol 
Recognition 


» for sua of 
m terms in a nusbcr series 


1. Supplying symbols and operationa 
for general cquations between 

numerical quantities having particular 
spatial relationa in a table 


1VA2. Using ayabolo 1VAB. Identifying 

spatial patterna of 
symbols and numbers 
in a table 


Patterno 
(Closure) 


11D. Supplying numbera and 
operations for specific equations 
between numbers having particular 
spatial relations in a table 


1110. Supplying numbers and 
for specific 
ween numerical 


Ivh2. Supplying 
minning digits in 
specific atate- 

mento of numerical 
equality 


Ivai. Supplying 
operationa to make 
equalities for opecific 
mumorical statements 


Fic. l. Hierarchy of knowledge for the task of finding formulas for the 
for the sum of » terms in a number series. 


be discussed later.) Each of these 
Subordinate knowledges can be repre- 


Sented as a class of task to be per- 
formed, 


Measuring Initial Patterns o 
Sets 


It is predicted that the presence of 
different patterns of learning sets can 
be determined for individuals who are 
unable to perform a final task such as 
the one under consideration. To test 
this, we administered a series of test 
items to a number of ninth-grade 
boys. These items were presented on 
4" X 6" cards, and the answers were 
written on specially prepared answer 
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sheets. This particular method was 
used in order to make testing continu- 
ous with the administration of a learn- 
ing program to be described hereafter. 
Each test item was carefully prepared 
to include instructions having the 
function of identification of terminal 
performance and of elements of the 
stimulus situation. 

Beginning with the final task, the 
items were arranged to be presented in 
the order I, IIA, IVA, IVA2, IVAB, 
IIB, IIIB, IVB1, and IVB2. For any 
given subject, the sequence of testing 
temporarily stopped at the level at 
which successful performance was first 
reached, and a learning program de- 
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signed to foster achievement at the 
next higher level (previously failed) 
was administered. This program and 
its results will Shortly be described. 
Following this, testing on the remain- 
ing learning set tasks was undertaken 
in the order given. The possibility of 
effects of the learning program on the 
performance of these lower-level learn- 
ing sets (not specifically practiced in 
the learning) is of course recognized, 
but not further considered in the pres- 
ent discussion. 

A particular time limit was set for 
each test item, at the expiration of 
which the item was scored as failed. If 
a wrong answer was given before this 
time limit, the subject was told it was 
wrong, and encouraged to try again; if 
the correct answer was supplied within 
the time limit, the item was scored as 
passed. It is emphasized that these 
time limits, which were based on pre- 
liminary observations on other subjects 
with these tasks, were not designed to 
put “time pressure” on the subjects, 
nor did they appear to do so, 

The patterns of success achieved on 
the final task and all subordinate learn- 
ing set tasks, by all seven subjects, are 
shown in Table 1. The subjects have 
been arranged in accordance with their 
degree of success with all tasks, begin- 
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ing with one who failed the final task 
but succeeded at all the rest. Several 
things are apparent from these data. 
First of all, it is quite evident that 
there are quite different “patterns of 
capability” with which individuals ap- 
Proach the task set by the study. Some 
are unable to do a task like IIA (see 
Figure 1), others to do a task like IB 
which is of course quite different. Still 
others are unable to do either of these, 
and in fact cannot perform successfully 
a task like IIIB. All seven of these 
subjects were able to perform IV-level 
tasks successfully, although in prelimi- 
nary observations on similar tasks 
we found some ninth-grade boys who 
could not. 

Second, the patterns of pass and fail 
On these tasks have the relationships 
predicted by the previous discussion. 
There are no instances, for example, 
of an individual who is able to perform 
what has been identified as a “higher- 
level” learning Set, and who then shows 
himself to be unable to perform a 
“lower-level” learning set related to it. 

If learning sets are indeed essential 
for positive transfer, the following con- 


Sequences should ensue: 
l. If a hi 


gher-level learning set is 
passed (+) 


» all related lower-level 
tasks must have been passed (+). 


TABLE 1 


PATTERN OF SUCCESS ON LEARN 
SERIES Task 


ING SET Tasks REL 
FOR SEvEN N 


ATED TO 


THE FINAL NUMBER 
INTH GRADE Boys 
Task 

Subject 

Rina : s NB} mp | gy IVà2 | as | iva: | wg; 
WW = + E + $ AX 
we | = +o} e a fk a x + | = + 
PM xi = = + + às 4 i i i 
GR ab | oF t t + + 
DJ 4 2 T t "i t 
JR bal d: T t t t 
RH ies i ri i + t 

Note.—+ = Pass; — = Fail. 
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TABLE 2 


Pass-FAIL RELATIONSHIP BETWEEN RELATED ADJACENT HIGHER- AND LOWER-LEVEL 


LEARNING SETS FOR A GROUP OF SEVE) 


NINTH-GRADE Boys 


Number of cases with relationship 


Test of relationships 


Relationship examined 


Higher + | Higher — | Higher + | Higher — N Proportion 
Lower + | Lower — | Lower — | Lower + | 0424+43) a +2)/(1 +2 +3) 
Final Task: I 0 6 0 1 6 1.00 

I: HA, IIB 1 5 0 1 6 1.00 
IIA: IVA, IVA2, IVAB 2 0 0 5 2 1.00 
IIB: IIB 3 2 0 2 5 1.00 
IIIB: IVAB, IVB1, IVB2 5 0 0 2 5 1.00 


Note.—-- = Pass; — = Fail. 


2. If one or more lower-level tasks 
have been failed (—), the related 
higher-level tasks must be failed (—). 

3. If a higher-level task is passed 
(+), no related lower-level tasks must 
have been failed (—). 

4. If a higher-level task has been 
failed (—), related lower-level tasks 
may have been passed (+). The ab- 
sence of positive transfer in this case 
would be attributable to a deficiency in 
instructions, and does not contradict 
the notion that lower-level sets are es- 
Jen the achievement of higher- 
Gn ur found to exist in 
cmd dien e jects are summarized in 
ag al uos Te ies possible higher- 
FÉ relationship possible of 

ng is listed in the left-hand col 
It will be noted that nie gianni 

l there are several 
relations! 1 
elationships of the type higher (—) 
lower (+), as listed in Golan 5. 
These provided no test of the hypothe- 
sis regarding hierarchical relations 
among learning sets. The instances in 
the remaining columns do, however 
The ++ and —— instances are verify- 
ing, whereas +— instances would be 
nonverifying. As the final column 
indicates, the percentage of verifying 
instances is in all cases 100%. 


Effects of Learning Program Adminis- 
tration 


If the characteristics of instructions 
as previously described are correct, it 
should be possible to construct a learn- 
ing program which can be begun for 
each individual at the point of his low- 
est successful learning set achievement, 
and bring him to successful achieve- 
ment of the final class of tasks. Briefly, 
its method should be to include frames 
which have the functions of (a) insur- 
ing high recallability of relevant learn- 
ing sets on which achievement has been 
demonstrated; (b) making possible 
identifications of expected perform- 
ance and of new stimuli, for each newly 
presented task; and (c) guiding think- 
ing so as to suggest proper directions 
for hypotheses associating subordinate 
learning sets with each new one. 

A program of this sort was admin- 
istered to each of the seven ninth-grade 
boys, beginning at the level at which he 
first attained success on learning set 
tasks (Table 1). This was done by 
means of à simple teaching machine 
consisting of a visible card file clipped 
to a board mounted at a 40° angle to 
the learner's table, and containing 
material typed on 4" X 6" cards. He 
wrote his answer to successive frames 
on a numbered answer sheet, then 
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flipped over the card to see the correct 
answer on the back. He was in- 
structed that if his answer was wrong, 

` he should flip the card back, and read 
the frame again until he could “see” 
what the right answer was. 

After completing the instructional 
portion of the program for each learn- 
ing set, the learner was again presented 
with the identical test-item problem he 
had tried previously and failed, If he 
was now able to do it correctly, he was 
given five additional items of the same 
Sort to perform, and then taken on by 
instructions to another learning set in 
either a coordinate or higher-level posi- 
tion in the hierarchy, This process 
was continued through the perform- 
ance of the final task. 

'The data collected in this way yield 

- pass-fail scores on each test item 
(representing a particular member of a 
class of tasks) before the administra- 
tion of the learning program, and simi- 
lar scores on the same item after learn- 
ing. It is recognized that for certain 
experimental purposes, one would wish 
to have a different, matched, task for 
the test given after learning, to control 
for the effects of "acquaintance" during 
the first test. Since this study had an 
exploratory character, such 4 control 
was not used this time, However, it 
Should be clearly understood that the 
first experience with these test items in 
question, for these subjects, involved 
only activity terminating in failure 
to achieve solution. No information 
about the correct solutions was given, 

A striking number of instances of 
success in achieving correct solutions 
to learning set tasks was found follow- 
ing learning as compared with before, 
These results; are summarized in 
Table 3. Although for learning set 

IIB the percentage of success was 
only 5096 (with two cases), there 
were two learning sets for which 
100% success was achieved, and the 
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TABLE 3 


NUMBER oF INSTANCES Or PASSING AND 
FAILING FINAL TAsK AND SUBORDINATE 
LEARNING SET Tasks BEFORE AND 
AFTER ADMINISTRATION OF AN 
ADAPTIVE LEARNING PRO- 

GRAM, IN A GROUP OF 
SEVEN NINTH- 


Grave Boys 
Number Number B (age 
Ic iling 'ercen 
Task balers passing tear succes 
learning learning 
Final Task 7 6 86 
I 5 4 80 
I 5 5 100 
IIB 2 1 50 
IIIB 2 2 100 
Total 21 18 86 


percentage for 
was 86%, 
tional evid 


all instances combined 
These results provide addi- 
t ence compatible with the 
idea of the knowledge hierarchy. 

The learner in Such a program does 
not "practice the final task"; he ac- 
quires specifically identified capabili- 
ties Jn a specified order, In as many 
as Six out of seven cases, we were able 
by this means to bring learners from 
various levels of competence all the 
way to final task achievement, (It is 
perhaps important that the exception 
was JR, one of two who had most to à 
learn.) Of course, it 
nized th; 
tribute 


IMPLICATIONS ror INDIVIDUAL 
IFFERENCES MEASUREMENT 

It 1s evident that learning sets, as con- 
ceived in this paper, Operate as “indi- 
Vidual differences" variables, which, 
when suitably manipulated, also þe- 
come “experimental” variables, There 


Tue ACQUISITION OF KNOWLEDGE 


are some additional implications which 
need to be pointed out regarding the 
functioning of learning sets in the de- 
termination of measured individual 
differences. 

As the process of identification. of 
subordinate learning sets is progres- 
sively continued, one arrives at some 
learning sets which are very simple and 
general, and likely to be widespread 
within the population of learners for 
which the task is designed. Consider, 
for example, learning set IVB (Figure 
1), which is represented by a task such 
as4 X 2— 5--? If one makes a further 
analysis to identify a subordinate learn- 
ing set for this task, the answer appears 
to be, “adding, subtracting, multplying, 
and dividing one- and two-place num- 
bers.” It is interesting to note that 
this is exactly the task provided by a 
set of factor reference tests (French, 
1954) called Number. In a similar 
manner, the other two circled entities 
in the last row of Figure 1 were 
identified. One is Symbol Recogni- 
Pes aled Associative Memory by 
hs Flan i and Recogni- 
ot Cas atterns | (called Flexibility 
jon Ld The implication is, there- 
fied’ by ae simplest tasks, identi- 
aut s analysis „techniques as 
e e variety of human 
pen » also function as learning 

The 1 
that [itd e = proposed 
transfer to Higher) mediate positive 
often, Sigh eds evel tasks, Very 

ly, the measu 
of transfer of training impli erm 
second task is learned B vd Mes " 
when preceded by the learnin: did 
initial task, than when not so pieced a 
Accordingly, it seems necessary 5k fim 
tinguish between expected correlations 
of these basic factors (at the bottom d 
the hierarchy) with rate of attainment 
of higher-level learning sets on the one 
hand, and correlations of these same 
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factors with achievement of higher- 
level learning sets on the other. 

The implications of this line of rea- 
soning would seem to be somewhat as 
follows: Factors which are found by 
the kind of psychological analysis pre- 
viously described to lie at the bottom of 
the knowledge hierarchy should exhibit 
certain predictable patterns of correla- 
tion with higher-level learning sets. 
They should correlate most highly with 
rate of attainment of the learning sets 
in the next higher level to which they 
are related, and progressively less as 
one progresses upwards in the hier- 
archy. The reason for this is simply 
that the rate of attainment of learning 
sets in a hierarchy comes to depend to 
an increasing extent on the learning 
sets which have just previously been 
acquired and accordingly to a decreas- 
ing extent upon a basic factor or abil- 
ity. Some analogy may be drawn here 
with the findings of Fleishman and 
Hempel (1954) on motor tasks. 

The expected relationships between 
factor test scores and achievement 
scores (passing or failing learning 
sets) throughout such hierarchies seem 
to require a somewhat more complex 
derivation. First of all, such relation- 
ships will depend upon the effective- 
ness of a learning program, or perhaps 
on the effectiveness of previous learn- 
ing. If the learning program is per- 
fectly effective, for example, and if 
differences in rate of attainment are 
everyone will pass all the 
sks, and the variance will 
accordingly be reduced to zero. Under 
these circumstances, then, one may 
expect all correlations with basic fac- 
tors to be zero. However, one must 
consider the case in which the learning 
program is not perfectly effective. In 
such a case, the probability that an 
individual will acquire a new learning 
set, as opposed to not acquiring it, will 
presumably be increased to the extent 


ignored, 
learning set tasks, 
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that he scores high on tests of related 
basic abilities. If one continues to 
collect scores on learning set tasks of 
both successful achievers and those 
who fail, the result will presumably be 

an increasing degree of correlation be- 
tween basic ability scores and learning 
set tasks as one progresses upwards in 
the hierarchy. The reason for this is 
that the size of the correlation comes 
to depend more and more upon vari- 
ance contributed by those individuals 
who are successful, and less and less on 
that contributed by those who effec- 
tively "drop out." 

The difference in expectation be- 
tween the increasing pattern of correla- 
tion with achievement scores, and the 
decreasing pattern with measures of 
rate of attainment, is considered to be 
of rather general importance for the 
area of individual differences measure- 
ment. Confirmatory results have been 
obtained in a recent study (Gagné & 
Paradise, 1961) concerned with the 
class of tasks "solving linear algebraic 
equations." 


Discussion 


The general view of productive 
learning implied in this paper is that 
it is a matter of transfer of trainin 
from component learning sets to a new 
activity which incorporates these pre- 
viously acquired capabilities. This 
new activity so produced is qualita- 
tively different from the tasks which 
correspond to the "old" learning sets; 
that is, it must be described by a dif- 
ferent set of operations, rather than 
simply being "more difficult" The 
characteristics of tasks which make 
achievement of one class of task the 
required precursor of achievement in 
another, and not vice versa, are yet to 
be discovered. Sufficient examples ex- 
ist of this phenomenon to convince one 
of its reality (Gagné, Mayor, Garstens, 
& Paradise, 1962; Gagné & Paradise, 
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1961). What remains to be done, pre- 
sumably, is to begin with extremely 
simple levels of task, such as discrimi- 
nations, and investigate transfer of 
training to tasks of greater and greater 
degrees of complexity, or perhaps ab- 
Stractness, thus determining the dimen- 
Sions which make transfer possible. 
The path to research on the charac- 
teristics of instructions appears more 
straightforward, at least at first glance. 
The establishment of identifications is 
à matter which has been investigated 
extensively with the use of paired as- 
Sociates. The employment of instruc- 
tions for this purpose may need to take 
Into consideration the necessity for 
learning differentiations among the 
stimulus items to be identified, as well 
as other variables suggested by verbal 
learning studies, The function of in- 
ducing high recallability would seem to 
be a matter related. to repetition. of 
learning set tasks, and may in addition 
be related to time variables such as 
those involved in distril 
tice. As for guidance 
distinguishing of this 
others performed by 


should at least make possible the de- 
a of more highly analytical studies 
t 


han have been possible in the past. 
In the meantime, the approach em- 


bution of prac- 
of thinking, the 
function. from 
instructions 


much 
a way of under- 
g of school sub- 
s and science, and 
Naturally, every 
different hier- 
learning sets when this 
method of analysis is applied. Often, 
the relationship of higher to lower 
€arning sets js more complex than 
that exhibited in Figure 1. It should 
be possible, beginning with any exist- 
ing class of tasks, to investigate the 
effects of various instruction variables 
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within the framework of suitably de- 
signed learning programs. 
, The major methodological implica- 
tion of this paper is to the effect that 
investigations of productive learning 
must deal intensively with the kinds of 
variables usually classified as “indi- 
vidual differences.” One cannot de- 
pend upon a measurement of general 
proficiency or aptitude to reveal much 
of the important variability in the capa- 
bilities people bring with them to a 
given task. Consider, for example, the 
seven ninth-grade boys in our study. 
Each of them had “had” algebra, and 
each of them had “had” arithmetic. 
There was no particularly striking rela- 
tionship between their ultimate per- 
formance and their previous grades in 
algebra (although there is no doubt 
some correlation), nor between this 
performance and “general intelligence.” 
But the measurement of their learning 
sets, as illustrated in Table 1, revealed 
à great deal about how they would be- 
have when confronted with the learn- 
Ing program and the final task, For 
rig ache ie had to begin, in ef- 
EEM pied m hari for others. 
while others could m n -: do 4 
equally well provid A pa could do it 
other things first Th xis T 
. The methodological 


point is sim is: i 

I : ply this: if one 
Investigate the effect gcc 
mental treatment on 
individuals or groups 


s of an experi- 
the behavior of 


) who sta 
the same point, he would be rit 


vised to measure and map out f 

individual the learning sets releva 2 
the experimental task. In this e$: » 
can have some assurance of the a eni 
to which his subjects are equivalent 5 
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ON THE RELATION BETWEEN LOGIC AND THINKING 


MARY HENLE? 
Graduate Faculty of Political and Social Science, 


New School for Social Research, New York 


The question of whether logic is de- 
scriptive of the thinking process, or 
whether its relation to thinking is nor- 
mative only, seems to be easily an- 
swered. Our reasoning does not, for 
example, ordinarily follow the syllo- 
gistic form; and we do fall into contra- 
dictions. On the other hand, logic 
unquestionably provides criteria by 
which the validity of reasoning may be 
evaluated. Logical forms thus do not 
describe actual thinking, but are con- 
cerned with the ideal, with “how we 
ought to think.” And yet a problem 
seems to be concealed beneath this easy 
solution. 

It is interesting to note that a num- 
ber of the older writers on logic 
regarded their discipline as the science 
of thought. It will not be possible 
here to survey the vast literature, some 
of which bears only by implication on 
our problem, but some examples will 
illustrate this position. Cohen and 
Nagel (1934) summarize the older 
view: “An old tradition defines logic 
as the science of the laws of thought” 
(p. 18). Kant (1885) holds that 
“logic is a science of the necessary laws 
of thought, without which no employ- 
ment of the understanding and the rea- 
son takes place" (p. 3). Psychology, 
on the other hand, supplies only the 
contingent, not the necessary, rules of 
thought. John Stuart Mill (1874) 
likewise views logic as comprising “the 
science of reasoning, as well as an art, 


founded on that science" (p. 18). He 
continues : 


*This paper was written when the author 
was a Fellow of the John Simon Guggen- 
heim Memorial Foundation, 


Whatever has at any time been concluded 
justly, whatever knowledge has been acquired 
otherwise than by immediate intuition, de- 
pended on the observance of the laws which 
it is the province of logic to investigate. 
If the conclusions are just, and the knowl- 
edge real, those laws, whether known or not, 
have been observed (p. 22). 


Boole (1854), as a final instance, re- 
gards "the laws of the symbols of 
Logic" as “deducible from a considera- 
tion of the Operations of the mind in 
Teasoning” (pp. 45-46). Boole de- 
duces the law of contradiction, for ex- 
ample, from a fundamental law of 
thought (pp. 49-51). 

ore recent writers, on the other 
hand, have tended to reject the view 
that the laws of logic are those of the 
human understanding, Again, only a 


few illustrations will be given. Cohen 
(1944) remarks: 


That the laws 


of logic are not the uni- 
versal law 


+. AWS according to which we do actually 
think is conclusively Shown, not only by the 
most elementary Observation or introspection, 
NU the very existence of fallacies (pp. 


A similar 
(1956) : 


Little need 
that logical 
necessities 


point is made by Nagel 


be said in refutation of the view 
Principles formulate the “inherent 
sities of thought” and are generalized 
Sun DHOIS of the operations of minds. 
IEEE the actual occurrence of beliefs in 
Ogically incompatible propositions makes 
nonsense of the claim that the principle of 
noncontradiction expresses a universal fact 
of PSychology (p, 66). 

Regarding the Work of Boole referred 
above (An Investigation of the Laws 
of Thought), Cohen and Nagel (1934) 
comment: “The title is a misnomer” 
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(p. 112). A more extreme statement 
of this point of view is by Bertrand 


Russell (1904) : 


Throughout logic and mathematics, the 
existence of the human or any other mind 
is totally irrelevant; mental processes are 
studied by means of logic, but the subject- 
matter of logic does not presuppose mental 
processes, and would be equally true if there 
were no mental processes (p. 812). 


Schiller (1930), too, holds that syllo- 
gistic reasoning “has nothing whatever 


to do with actual reasoning, and can 
make nothing of it" (p. 282). He de- 


. Scribes the “Laws of Thought" (laws 


Sage contradiction, and excluded 

niddle) as “v ions" 
251. erbal conventions" (p. 
The changed point of view with re- 
gard to the relation between logical 
principles and the laws of thought 
seems to be a function of an altered 
intellectual climate rather than of any 
fundamental discoveries about the na- 
ture of reasoning. It thus seems worth- 
while to reopen the question, the more 
mt it has implications for a num- 
Sii it se eee 
Mii . ; however, it may be 
erest to examine a few more forms 


in which the 
» ^ 
iue present question has been 


J Discussion of the 
gism has at times 
relevance to actual 


figures of the syllo- 
centered on their 
al thinking, J, N. 
cites several writers 


they hold that we urth figure because 


ae d 
son in it (pp. 230-2 3l S actually rea- 


indeed, finds all but the fi 
CEN and false; the fou 
ar he calls unnatural 
Keynes, on the other mrt ur 
the admission of Figure 4 on thé sa n 
grounds, namely its relevance to ach] 
thinking: “It is not actually in frequent 
use, but reasonings may sometimes not 
unnaturally fall into it" (p. 232). 
Psychologists investigating reasoning 


tst figure both 
rth in particu- 


Kant (1885),: 


processes have tended to underempha- 
size the role of logic in the thinking of 
their subjects. To illustrate, Bruner, 
Goodnow, and Austin (1956) suggest 
that 


much of human reasoning is supported by 
a kind of thematic process rather than by an 
abstract logic. The principal feature of this 
thematic process is its pragmatic rather than 
its logical structure (p. 104). 


Individuals tend to prefer “empirically 
reasonable propositions” to logical ones 
(p. 104). Morgan and Morton (1944) 
conclude that : 


A person is likely to accept a conclusion 
which expresses his convictions with little 
regard for the correctness or incorrectness 
of the inferences involved. Our evidence will 
indicate that the only circumstance under 
which we can be relatively sure that the in- 
ferences of a person will be logical is when 
they lead to a conclusion which he has 
already accepted (p. 39). 


Lefford (1946) states that the princi- 
ples of logical inference “are techniques 
which are not the common property of 
the unsophisticated subject” (p. 144). 
He goes so far as to distinguish from 
the logical inference 


psychological inferences which may be made 
by the ordinary person... . A psychological 
inference is not valid or invalid except when 
judged as a logical inference: psychological 
inference is purely a fact (p. 145). 


Common to all these statements by 
the psychologists is the assumption that 
logical principles are irrelevant, if not 
antithetical, to much actual reasoning.” 
This conclusion is derived from the 


2A different, but related, position is that 
represented by Dollard and Miller (1950), 
who hold that being logical is a learned drive. 
The child, it is argued, is punished for logi- 
cal contradictions and absurdities, for il- 
logical and contradictory plans. The result 
for most people is “a learned drive to make 
their explanations and plans seem logical” 
(p.120). The implication is, of course, that 
without this specific training the individual's 
thinking would not be (or seem) logical. 
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high incidence of wrong inferences of 
subjects under test conditions, espe- 
cially in the case of emotionally relevant 
material. A different conclusion has 
been drawn by Von Domarus (1944) 
from observation of errors in the rea- 
soning of schizophrenic patients. This 
author argues, not that the reasoning of 
his subjects is unrelated to logic, but 
rather that it conforms to a logic whose 
laws are different from those of Aristo- 
telian logic. This is “paralogic,” which 
excludes the law of contradiction and 
“accepts identity based upon identical 
predicates” (p. 111). This idea has 
been elaborated by Arieti (1955) who 
sees the operation of this logic (which 
he calls paleologic) not only in schizo- 
phrenic and primitive thinking, as Von 
Domarus does, but also in dreams, in 
some infantile thinking, and in the 
transference situation in psychoanaly- 
sis. 

Again it is of interest to find that 
the earlier writers mentioned above 
were equally aware of the problem of 
error, but viewed it in a way that was 
entirely compatible with their concep- 
tion of logic as the science of the laws 
of the mind. “It is easy to see how 
truth is possible,” writes Kant (1885, 
p. 44) “since in it the understanding 
acts according to its own essential 
laws.” Error, however, is difficult to 
understand since it constitutes “a form 
of thought inconsistent with the under- 
standing.” Its source is thus not to be 
sought in the understanding itself, but 
rather in the “unobserved influence of 
the sensibility on the understanding,” 
the sensibility being that faculty which 
“supplies the material for thought. 
Boole (1854) likewise considers that 
“the phaenomena of incorrect reason- 
ing or error... are due to the inter- 
ference of other laws with those laws 
of which right reasoning is the prod- 
uct" (p. 409). He reminds us that 
*the laws of correct inference may be 
violated, but they do not the less truly 
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exist on this account" (p. 408). Mill 
(1874) is still more explicit, Discuss- 
ing fallacies of ratiocination, he points 
out that since 


the premises are seldom formally set out,..- 
it is almost always to a certain degree 
optional in what manner the suppressed link 
shall be filled up. . . . [A person] has it 
almost always in his power to make his 
Syllogism good by introducing a false prem- 
ise; and hence it is scarcely ever possible 
decidedly to affirm that any argument in- 
volves a bad syllogism (p. 560). 


In the case of arguments consisting 
not of a single syllogism but of a chain 
of syllogisms, he considers the common- 
est fallacy of ratiocination to lie in a 
changing of the premises as the argu- 
ment proceeds. 

Two clearly contrasting alternatives 
thus present themselves: Is logic (or 
Aristotelian logic) largely irrelevant to 
the thinking process, or is it concerned 
with the laws of thinking? Since we 
will here be concerned only with deduc- 
tive reasoning, we may reformulate the 
question more specifically in these 
terms. But since, as has so often been 
pointed out, the premises from which 
we reason are commonly not spelled 
out, since our inferences so frequently 
appear as enthymemes,* this fact must 
be taken into consideration, We may 
ask: If we know the premises—tacit 
as well as explicit—from which a per- 
Son reasons, can we put the process in 
syllogistic form? Do the rules of the 
syllogism describe processes that the 
mind follows in deductive reasoning, 
even when the syllogistic form is not 
explicitly employed? 

It has been shown above that the 
existence of error has been used as evi- 
dence for the irrelevance of logic to the 
actual thinking process. On the other 
hand, a different interpretation of error 

3"A syllogism that is incompletely stated, 
in which one of the premises or the conclusion 
is tacitly present but not expressed, is calle 


an enthymeme" (Cohen & Nagel, 1934, P- 
78). 
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has been suggested. Since the problem 
of error seems to bea particularly fruit- 
ful one in which to join the issues 
before us, it will be taken as the con- 
text for the present discussion. Once 
more we may reformulate our question 
as follows: Do errors in deductive rca- 
soning mean that the logical process has 
been violated? As Mill expresses it, 
does the occurrence of error mean that 
the syllogism is a bad one? Or can 
the error be accounted for otherwise? 
Is it possible that a process that would 
follow the rules of logic if it were 
spelled out is discernible even when the 
reasoning results in error? 

The distinction being made here is 
a familiar one in the psychology of 
learning and thinking. Thus Koffka 
(1935, Ch. 12) distinguishes between 
learning as accomplishment and the 
learning processes responsible for this 
accomplishment. Kóhler's (1927) dis- 
tinction between "good errors" and 
stupid ones is likewise relevant. Good 
errors, he points out, "may, in a cer- 
tain sense, be absolutely appropriate to 
the situation" (p. 217), although they 
Solve the problem no more than do 
stupid ones. Wertheimer (1959), too, 
distinguishes between solutions ob- 
tained by “blind” procedures and “fine, 
Senuine solutions.” Again the differ- 
ence is one of process, since in both 
cases the result may be the same. In 
the same way, in connection with the 
present Problem, we may ask: Given 
contrasting results—correct Solutions 
and errors in deductive reasoning— 
what can we say about the thinking 
processes that account for them? Are 
the processes necessarily different be- 
cause their effects are different? 

Illustrative data that bear on this 
issue will be presented. They were ob. 
tained from 46 graduate students of 
Psychology who were asked to evaluate 
the logical adequacy of deductions pre- 
sented in the context of everyday prob- 
lems. Most of the subjects had no 
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training in formal logic.t The material 
was presented under group conditions, 
the subjects writing out their judg- 
ments and their grounds for making 
them. Instructions included an explicit 
statement that the logical adequacy of 
the arguments was to be judged, not 
the truth of the statements. 

Individual interviews were also con- 
ducted with a different set of subjects. 
Although the results agree generally 
with those obtained under group condi- 
tions, they will not be presented because 
of a problem already mentioned. Mill, 
it has been seen, pointed to the difficulty 
of recognizing a bad syllogism because 
a person can easily introduce into his 
incompletely stated argument a new 
premise that will make the inference 
valid. It was often difficult to decide 
whether the material elicited by direct 
questioning consisted of new premises 
not entering into the original inference, 
or whether the subject was, as intended, 
making clear his original understanding 
of the premises. Although the group 
data are less rich than the individual 
interviews, and although the products 
of written communication are likely to 
be less spontaneous reflections of the 
thinking process than interviews, they 
exclude this possibility of overlooking 
true fallacies ; indeed, if anything, they 
Overestimate the incidence of bad syllo- 
gisms, since the reasoning is often in- 
completely described in the case of 
apparent errors.5 

The present data will be used only to 
ilustrate the reasoning processes in 
cases of error; no quantitative results 


* It must be recognized, of course, that the 
lack of formal training in logic is insufficient 
guarantee of the naiveté of subjects. It can 
only be pleaded that we have no better cri- 
terion at the present time. Whatever the in- 
formal selí-education of subjects in this re- 
Spect, the results to be presented cannot be 
regarded as the product of formal instruction 
in logic. 

^A sample interview may be found in 
Henle (1958). 
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will be presented. As many authors 
have shown, the incidence of error in 
deductive reasoning depends on the 
form of the syllogism and its contents, 
as well as on instructions to the sub- 
jects. Quantitative results would have 
relevance only to the particular condi- 
tions studied here, whereas an inquiry 
into the nature of the errors obtained 
might be of more general interest. 

Several processes may be distin- 
guished which led to error in dealing 
with the presented material: 


FAILURE TO ACCEPT THE LOGICAL 
TASK 


More specifically, this means failure 
to distinguish between a conclusion that 
is logically valid and one that is fac- 
tually correct or one with which the 
subject agrees. This source of error 
has already been reported by Henle 
and Michael (1956, p. 124). 

A sample syllogism follows, along 
with responses in which errors occur 
because of failure to grasp or accept the 
_ logical task. 

Syllogism 6. A group of women 
were discussing their household prob- 
lems. Mrs. Shivers broke the ice by 
saying: “I’m so glad we're talking 
about these problems. It’s so impor- 
tant to talk about things that are in 
our minds. We spend so much of our 
time in the kitchen that of course house- 
hold problems are in our minds. So it 
is important to talk about them.” (Does 
it follow that it is important to talk 
about them? Give your reasoning.) 

Responses: “The conclusion does not 
follow. The women must talk about 
household problems because it is im- 

portant to talk about their problems, 

not because the problem is in their 
um 

cos It is not important to talk 

about things that are in our minds un- 

Jess they worry us, which is not the 


case.” 
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“No. Just because one spends ‘so 
much time’ in the kitchen it does not 
necessarily follow that household prob- 
lems are ‘in our minds’.” 

It should be noted that subjects 
who failed to accept the logical task 
frequently gave correct responses that 
are just as irrelevant to the question of 
the relation of logic to the thinking 
process as are the errors just cited. A 
few examples follow: 

“Yes. It could be very important to 
the individual doing the talking and 
possibly to some of those listening, be- 
cause it is important for people to ‘get 
a load off their chest,’ but not for any 
other reason, unless in the process one 
or the other learns something new and 
of value.” 

"Yes. It seems obvious that prob- 
lems which are in the forefront of one's 
mind bring more consideration to them 
and possibly newer aspects when they 


are discussed with another. Two heads 


may be better than one." 

“Yes it does. By talking household 
problems, a problem can be solved or 
worked through." 

The errors illustrated here clearly do 
not demonstrate an inability of the sub- 
jects to reason logically, since they have 
not accepted the logical task. They 
have evaluated tlie content of the con- 
clusion, not the logical form of the 
argument. Richter (1957, p. 341) 
makes the same distinction, carrying 
the analysis a step farther. Apart from 
careless mistakes and those arising 
from “imperfections in the classifying 
operation”—i.e., from an inability to 
make logical deductions, he describes 
errors arising from “a general failure 
to grasp the concept of ‘logical va- 
lidity’” and those arising from “the 
specific inability to differentiate ‘logical 
validity’ from another attribute of syl- 
logisms," namely their factual status. 
To apply to the present data, the factual 
criterion needs to’ be interpreted to 
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mean what is believed to be true or 
reasonable as well as what is known 
to be true. With Richter’s distinction 
in mind, we may note that no subject 
failed completely to grasp the logical 
task. Still, one individual judged on 
the basis of belief rather than logical 
validity in the case of 9 out of 10 syl- 
logisms, and several others judged in 
this manner with at least half of the 
syllogisms presented. 


RESTATEMENT OF A PREMISE OR 
CONCLUSION sO THAT THE 
INTENDED MEANING Is 
CHANGED 


In a number of cases examination 
of the material from which a subject 
reasoned showed that it differed from 
that which was originally presented. 
In such cases the validity of the argu- 
ment can, of course, be judged only in 
relation to the syllogism actually em- 
ployed, not the one intended. 

1. In the case of syllogisms in which 

a conclusion from two particular prem- 
ises was to be evaluated, the premises 
were occasionally restated as univer- 
sals, from which the conclusion fol- 
lowed. The following syllogism per- 
mitted this kind of change: 
A Syllogism $. Mrs. Cooke had stud- 
ied home economics in college. “Youth 
Is a time of rapid growth and great 
demands on energy,” she said. “Many 
youngsters don’t get enough vitamins 
in their daily diet. And since some 
vitamin deficiencies are dangerous to 
health, it follows that the health of 
many of our youngsters is being en- 
dangered by inadequate diet." (Does it 
follow that the health of many young- 
sters is being endangered by inadequate 
diet? Give your reasoning.) 

Responses: “Yes. Youngsters (A) 
don’t get vitamins (B). Not getting 
vitamins (B) dangerous to health (C). 
Youngsters (A) in danger of poor 
health (C).” 
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“This follows. . . . Youth doesn’t get 
enough vitamins. Vitamins are neces- 
sary or bad health results. Therefore 
youth are endangered. All of these 
follow logically, and I believe a correct 
inference has been made.” 

2. Other changes were introduced 
into the premises by subjects whose 
reasoning was found to be correct even 
though their answers to the intended 
problem were wrong. The same syllo- 
gism (Syllogism 8) will be used to 
illustrate. 

Responses: “There is some question 
about the equivalence of 'don't get 
enough vitamins' and a vitamin defi- 
ciency dangerous to health. If one 
assumes their equivalence, then the con- 
clusion just about manages to follow." 

"Correct if we assume that the 
youngsters are lacking those vitamins 
in their diet which endanger health." 

“Tt seems to follow, assuming that the 
deficient vitamins are also the vital 
ones." 

Everyday discussions permit greater 
scope for such changes in meaning than 
does the material employed here. Pri- 
vate meanings of terms or idiosyn- 
cratic or unconscious equations of con- 
cepts may enter in to prevent two 
people from drawing the same conclu- 
Sion even though they start ostensibly 
from the same material. 


OMISSION or A PREMISE 


Occasionally a subject employed only 
one of the presented premises. The 
informal manner in which the premises 
were set out made possible this treat- 
ment of the material. Syllogism 6 (cf. 
above) will be used to illustrate. 

Responses: “Not correct logically. 
(a) A group of women spend much 
of their time in the kitchen; (b) thus 
household problems occupy their minds. 
- .. (c) Therefore it is important to 
talk about them. Mrs. Shivers’ sub- 
jective thinking." 
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"Doesn't follow from the preceding 
statement. Because she spends a lot 
of time in the kitchen and household 
problems are on her mind, it doesn't 
follow that it is important to talk about 
them." 

In a few cases, indeed, both of the 
intended premises were omitted by the 
subject. For example (Syllogism 6): 

“No it doesn't follow. The fact that 
she spends much time in the kitchen 
has nothing to do with whether or not 
it is important to talk about the prob- 
lems." 

It will be seen that in all these cases 
in which a premise has been omitted, 
the subject correctly reports that the 
conclusion does not follow. If we dis- 
regard the intended premises and con- 
sider only the material actually used 
in reasoning, the subjects’ deductions 
are seen to be correct. 


SLIPPING IN OF ADDITIONAL 
PREMISES 


Under the present conditions this de- 
vice was infrequent, but it seems to 
be more common where premises are 
stated less completely, and where the 
issues are of more immediate practical 
concern, as in everyday discussion. 
For example, premises may be added 
that are so commonplace, so much taken 
for granted, that they escape attention. 

Instances of the adding of assump- 
tions to our material follow. Since they 
are readily understandable, and for the 
sake of brevity, the responses will be 
quoted without the presented syllogism. 

Responses: “Without intending 
sophistry it must be pointed out that 
the implicit premise is ‘crime is bad. 
If this is granted, the conclusion follows 
that if comics contribute to influence 
the youth along criminal paths, they 
should be eliminated.” "m 

* . . If comic books are an evil in- 
füeiice; then they should be got rid 
of.” Here the subject is clearly em- 
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ploying an implicit premise that he 
added to the material presented, namely 
“Whatever is an evil influence should 
be got rid of.” 

Again in these cases, wrong answers 
are obtained by correct reasoning if we 
consider the syllogism as the subject 
understood it rather than the one the 
investigator hoped to present. 

When all the above processes have 
been taken into account, a considerable 
number of errors remain. Do these 
represent true fallacies? It is difficult 
to say since in these cases either no 
account of the reasoning process is sup- 
plied, or an unclear explanation 1S 
given, or else the subject merely sets 
out the premises and conclusion given 
to him without comment. It might be 
that many of these errors would have 
been found on examination to involve 
processes similar to the ones described 
above, so that no error in reasoning is 
involved. On the other hand, the pos- 
sibility of true fallacies cannot be ex- 
cluded. 

In connection with the unexplained 
errors, it is worth noting that almost 
as many correct responses as errors 
were unexplained or unclearly ex- 
plained. The absence of explanation 15 
thus no reason to suspect a reasoning 
process of being fallacious. 

It must also be mentioned that the 
processes described above as producing 
apparent errors although the deductive 
Process is correct are by no means 
viewed as constituting a complete list- 
They were observed with particular 
materials and under particular condi- 
tions; and it is to be anticipated that 
further research will expand this list- 
Chapman and Chapman (1959), for 
example, have suggested several addi- 
tional processes by which error is pro- 
duced in syllogistic reasoning, one ° 
which involves a misunderstanding ° 
the task different from that describe¢ 
here. 
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Despite the limitations of the present 
data, they do show clearly that when 
subjects arrive at apparently invalid 
conclusions, or when they fail to spot 
a fallacy, they often do so because they 
have worked with materials different 
from those intended or because they 
have undertaken a task different from 
the one intended. In such cases, if 
we consider the materials and task as 
they were actually understood by indi- 
vidual subjects, we fail to find evidence 
of faulty reasoning. It must be con- 
cluded that the presence of error does 
not constitute evidence that the laws 
of logic are irrelevant to actual think- 
ing. The data tend, rather, to support 
the older conception that these laws are 
widely discernible in the thinking 
process. 


Discussion 


An examination of errors in syllo- 
gistic reasoning leads to two conclu- 
sions: (a) While the possibility of fal- 
lacy often cannot be excluded it is, in 
Mill’s words, “scarcely ever possible 
decidedly to affirm that any argument 
involves a bad syllogism.” (b) Where 
error occurs, it need not involve faulty 
reasoning, but may be a function of 
the individual’s understanding of the 
task or the materials presented to him. 
Spinoza (1930) has stated our present 
conclusion in more general terms: 


When men make errors in calculation, the 
numbers which are in their minds are not 
those which are upon the paper. As far as 
their mind is concerned there is no error, 
although it seems as if there were, because 
we think that the numbers in their minds are 
those which are upon the paper. If we did 
not think so, we should not believe them to 
be in error. For example, when I lately 
heard a man complaining that his court had 
flown into one of his neighbour's fowls, I 
understood what he meant, and therefore 
did not imagine him to be in error. This is 
the source from which so many controversies 
arise—that men either do not properly ex- 
plain their own thoughts, or do not properly 


interpret those of other people; for, in truth, 
when they most contradict one another, they 
either think the same things or something 
different, so that those things which they 
suppose to be errors and absurdities n 
another person are not so (from the Ethics, 
Scholium Proposition 47, Part 2). 


Such a conclusion, which the present 
analysis supports, is of some import- 
ance if, as Mill (1874) writes, “to 
draw inferences has been said to be 
the great business of life" (p. 21). 
Our data suggest that the intelligent 
adult is not so inept at carrying on this 
business as several recent accounts 
have made him out to be. 

In considering the place of logical 
inference in everyday life, it is neces- 
sary also to take into account what 
Aristotle (1945) has termed the “prac- 
tical syllogism," the case in which 


the conclusion drawn from the two premisses 
becomes the action. For example, when you 
conceive that every man ought to walk and 
you yourself are a man, you immediately 
walk; or if you conceive that on a particular 
Occasion no man ought to walk, and you 
yourself are a man, you immediately remain 
at rest. In both instances action follows 
unless there is some hindrance or compulsion. 
-.. Again, I need a covering, and a cloak 
is a covering, I need a cloak, What I need 
I ought to make; I need a cloak, I ought to 
make a cloak. And the conclusion “I ought 
to make a cloak" is an action. The action 
results from the beginning of the train of 
thought. If there is to be a cloak, such and 
such a thing is necessary, if this thing then 
something else; and one immediately acts 
accordingly. That the action is the con- 
clusion is quite clear... . (701a). 


Aristotle points out that this whole 
process need not be explicit. 


The mind does not stop and consider at all 
one of the two premisses, namely, the obvious 
one; for example, if walking is good for a 
man, one does not waste time over the 
premiss "I am myself a man.” Hence such 
things as we do without calculation, we do 
quickly (701a). 


Again, "My appetite says, I must 
drink; this is drink, says sensation or 
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imagination or thought, and one im- 
mediately drinks.” 

If we include, as it seems we must, 
the practical syllogism in the “great 
business of life,” that of drawing in- 
ferences, it becomes difficult indeed to 
accept the conclusion that human 
beings are unable to reason logically. 
A couple of contemporary examples 
will suggest how ubiquitous is the 
practical syllogism in everyday life. 
"Tt is too far to walk to the Public 
Library ; I must take a subway or bus. 
The Fifth Avenue bus passes the 
Public Library. I must take the Fifth 
Avenue bus.” “I do not want to wear 
the same dress two days in succession. 
I wore this dress yesterday; so I do 
not want to wear this dress today.” It 
is difficult to see how the individual 
could cope with the ordinary tasks of 
life if the practical syllogism embodied 
techniques which are not, as one author 
quoted above put it, “the common 
property of the unsophisticated sub- 
ject,” if, indeed, it were not a natural 
mode of functioning of the conscious 
mind. 

Furthermore, if people were unable 
to reason logically, so that cach arrived 
at different conclusions from the same 
premises, it is difficult to see how they 
could understand each other, follow 
one another's thinking, reach common 
decisions, and work together. 

It might be argued that in the case 
of the practical syllogism we are deal- 
ing not with an implicit logical process, 
but with a learning process in which 
each response is cued off by the pre- 
ceding one. There seem to be at least 
two grounds for rejecting such an in- 
terpretation. (a) There are cases m 
which the practical syllogism leads to 
a solution which is genuinely novel for 
the individual. This is surely so for 
some of the examples cited above, at 
least the first time they occur. Indeed, 
if the responses were habitual ones, the 


reasoning process would not be neces- 
sary. (b) Our data obtained from 
conventional syllogisms suggest that a 
logical process often occurs even in 
cases in which the reasoning scems 
fallacious. Such data can thus not be 
interpreted in terms of the “learned 
drive” postulated by Dollard and 
Miller (1950, p. 120) “to make . . . 
explanations and plans seem logical." 
And since they sometimes lead to 
novel conclusions (cf. Wertheimer, 
1938), they cannot be interpreted as 
habitual responses. In the interests of 
parsimony, therefore, a common ex- 
planation for the practical syllogism 
and the verbal one seems preferable. 
An interpretation in terms of an im- 
plicit logical process would fit both 
kinds of case. 

Two further implications of the 
present data may be mentioned. They 
suggest an approach to the problem of 
the influence of needs and attitudes 
on the reasoning process. For the 
problem remains: why do motivational 
influences so often appear to impair 
thinking? It is a plausible hypothesis 
that these influences do not distort the 
reasoning process, as has frequently 
been stated or implied—indeed that 
they do not act at all on the reasoning 
process—but rather that they affect the 
materials with which thinking works. 
Although this is not the place for a 
detailed analysis, one or two specific 
effects of this kind may be suggested. 
(a) It may be that a strong attitude 
toward, or emotional involvement with, 
particular material is in part respon- 
sible for the difficulty which many un- 
sophisticated subjects experience 1n 
distinguishing between drawing a con- 
clusion that is logically valid and one 
that is believed to be correct. Over 4 
wide range of practical situations these 
two tasks are not in conflict; and the 
distinction is thus not an easy one. 
There is evidence that attitudes and 
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emotions may limit the ability to make 
distinctions (cf. Henle, 1955). If this 
suggestion is correct, the more per- 
sonally relevant the material employed, 
the more difficult it will be to accept 
the logical task. Such an effect might 
go z long way toward explaining the 
report of a number of writers that in- 
dividuals are unable to reason logically 
about emotionally toned material. (b) 
An attitude can select from among the 
possibilities that the material presents, 
singling out, for example, one among 
Several possible meanings. It may be 
that such a process is responsible for 
the changes in the meaning of premises 
observed in our data. (For possibili- 
ties of further analysis of this kind, cf. 
Henle, 1955.) 

That motivational processes may act 
on reasoning in the manner suggested 
here receives support from a recent 
study by Kopp (1960), who subjected 
to a similar analysis reasoning proc- 
esses related to the delusions of 
paranoid schizophrenics. His subjects, 
like the present ones, made errors 
which were found to lie more in the 
premises from which the reasoning 
proceeded than in the actual drawing 
of inferences. 

In this connection, it would be in- 


teresting to analyze Arieti's (1955) ex- 


amples of “paleological” thinking in the 
terms of the present study. Unfortu- 
nately, the published data are too scant 
to permit such an analysis. The ex- 
amples presented would be fallacies in 
terms of Aristotelian logic if one could 
assume that the premises as understood 
by the subject are the same as those 
understood by the examiner. "That this 
is the case may be doubted in a number 
of instances; and the search for an 
archaic logic thus seems premature. 
Some examples will be given which 
seem to call for an analysis different 
from Arieti's. 

Arieti cites the case of a woman who 
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“needed to identify herself with the 
Virgin Mary because of the extreme 
closeness and spiritual kinship she felt 
for the Virgin Mary” (1955, p. 195). 
The reasoning process is described as 
follows: “The Virgin Mary was a 
virgin; I am a virgin; therefore, I am 
the Virgin Mary.” In the absence of 
additional material, we may only 
speculate that the proposition "I am 
the Virgin Mary" was not a conclusion 
deduced from the stated premises, but 
that it belongs rather in the category 
of revealed truth or intuition, not in 
that of deductive inference. It seems 
likely to the present writer that the 
syllogism quoted above was in the 
nature of a justification of this intui- 
tion to the examiner rather than the 
grounds for belief in it. The argument 
itself could be evaluated only in the 
light of additional information. 
Another kind of example of Arieti's 
(1955) illustrates what he considers to 
be the propensity of young children 
"to indulge in paleologic thinking" : 


A girl, three years and nine months old, 
saw two nuns walking together, and told her 
mother, "Mommy, look at the twins!" She 
thought that the nuns were twins because 
they were dressed alike. The characteristic 
of being dressed alike, which twins often 
have, led to the identification with the nuns 
(p. 199). 


Again a different interpretation sug- 
gests itself: rather than viewing it as 
a deduction, we may take this process 
more simply as an instance of an in- 
accurate concept. Arieti also discusses - 
dream symbols as examples of paleo- 
logic thinking. It is particularly dif- 
ficult for the present writer to view 
such symbols as inferences, as if they 
were the conclusions of processes of 
deductive reasoning. They seem, 
rather, to involve a much more im- 
mediate grasping of similarities. 

It is amusing to note that Arieti 
(1955) regards some delusions of 
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identification as instances of paleologic, 
and describes the "formal mechanism" 
by which they are produced in a way 
that could easily be restated as a valid 
Aristotelian syllogism : 


If A may be identified with B because 
they have a common quality, it will be suf- 
ficient for me to acquire a quality of the 
person I want to be identified with, in order 
to become that person (p. 207). 


Further discussion of the problems 
raised by Arieti must await more 
complete data. 

The present study has implications 
also for another problem. To label an 
inference fallacious, as is clear from the 
analysis of this paper, is to make a 
statement about the results of reason- 
ing, but not about the process that is 
responsible for these results. It might 
be a matter of considerable interest to 
study systematically and in detail the 
reasoning involved in various fallacies. 
It might be found that many fallacies 
are produced, not by faulty reasoning, 
but by specific changes in the material 
as the subject understands it. Some 
of the syllogisms employed here in- 
volved the fallacy of the undistributed 
middle; here changes of premises from 
particulars to universals and other 
changes in the meaning of premises 
were not uncommon. Two of the 
arguments presented contained illicit 
conversions, but no relevant data were 
obtained since the subjects systemati- 
cally ignored the conversion and cen- 
tered their attention on other parts of 
the material? In connection with il- 
licit conversions, the discussion of 
Chapman and Chapman (1959) is of 
interest; these authors suggest implicit 


6 This unexpected finding cannot be attrib- 
uted to an insensitivity of the subjects to 
logical processes. Rather the fault lies with 
the investigator who, in the effort to dis- 
formal structure of the syllogism, 
irrelevancies that were taken by 
e material for evaluation. 
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assumptions and changes in meaning 
that may be involved in the acceptance 
of such conversions. Mill's (1874) 
observation that fallacies frequently 
involve changing the premises has been 
cited above. He analyzes several ex- 
amples of the fallacy a dicto secundum 
quid ad dictum simpliciter? in these 
terms (p. 561-562). Fallacies of 
ambiguity seem likewise to lend them- 
selves to the kind of analysis suggested 
here.® 


In conclusion, observations have 
been presented which are consistent 


*"'This is committed when, in the prem- 
ises, a proposition is asserted with a quali- 
fication, and the qualification lost sight of 
in the conclusion; or oftener, when a limita- 
tion or condition, though not asserted, is 
necessary to the truth of the proposition, 
but is forgotten when that proposition comes 
to be employed as a premise" (Mill, 1874, 
p. 561). 

SIt is interesting to note that Mill's 
(1874) view comes close to the hypothesis 
offered here. It is true that he describes 
fallacies of ratiocination as: 


those which have their seat in the rati- 
ocmative or deductive part of the investi- 
gation of truth (p. 559). 


However he adds that it is by the rules of 
the syllogism 


that a reasoner is compelled distinctly to 
make his election what premises he is 
prepared to maintain. The election made, 
there is generally so little difficulty in 
seeing whether the conclusion follows 
Írom the premises set out, that we might 
without much logical impropriety have 
merged this fourth class of fallacies in the 
fifth, or Fallacies of Confusion (p. 560 


The latter class includes: 


all those fallacies in which the source of 
error is not so much a false estimate ? 
the probative force of known evidence, 25 
an indistinct, indefinite, and fluctuating 
conception of what the evidence is 
563). 


(This class includes errors involving ambi- 
guity of terms, begging the question, an 
fallacies of irrelevance.) 
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with the view that the rules of the 
syllogism describe processes that the 
mind follows in deductive reasoning, 
even when the syllogistic form is not 
explicitly employed.” A word of cau- 
tion is necessary at this point. The 
present observations apply to deduc- 
tive reasoning only. Even within this 
category, a very limited number of 
logical forms has been studied ; and the 
study has been concerned only with the 
evaluation of presented arguments, not 
with the construction of arguments of 
one's own. The investigation needs to 
be extended before the generality of 
the conclusion can be assessed. It 
must also be pointed out that there arc 
many types of thinking to which logical 
analysis does not apply at all, for ex- 
ample many aspects of free association, 
fantasy, many creative processes. 'To 
apply logical analysis to such processes 
(for example, Arieti’s discussion of 
dream symbols) is as much an error 
as to ignore the underlying logical 


Structure of incompletely stated argu- 
ments. 


SUMMARY 


Two views of the relation of logic 
to thinking have been presented: the 
older view that logic is the science of 
thought, and the position more widely 
held at present that its relation to 
thinking is normative only. It has 
been acknowledged that ordinary 
thinking, more often than not, does not 
assume the form of the Syllogism, and 
that errors in reasoning are frequent, 
But does error mean that the logical 
process has been violated? or can we 


91f it could be established generally that 
the laws of logic are those of the human 
mind, this would not be to give the former 
the status of “mere facts” or to make them 
relative to particular conditions; rather it 
Would be to recognize the requiredness in 
thinking (cf, Kohler, 1938, especially pp. 
45, 52) 
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account for it in other ways? Evi- 
dence has been presented that suggests 
that even where the thinking process 
results in error, it can often be shown 
that it does not violate the rules of the 
syllogism. Many errors were found to 
be accounted for not in terms of a 
breakdown of the deductive process 
itself, but rather in terms of changes 
in the material from which the reason- 
ing proceeds. The problem of the role 
of logical processes in everyday life 
was discussed more generally, and 
some implications of this study for 
other problems were considered. 
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The problem of which cues, internal 
or external, permit a person to label 
and identify his own emotional state 
has been with us since the days that 
.James (1890) first tendered his doc- 
trine that "the bodily changes follow 
directly the perception of the exciting 
fact, and that our feeling of the same 
changes as they occur is the emotion" 
(p. 449). Since we are aware of a 
variety of feeling and emotion states, it 
should follow from James' proposition 
that the various emotions will be ac- 
companied by a variety of differentiable 
bodily states. Following James' pro- 
nouncement, a formidable number of 
studies were undertaken in search of 
the physiological differentiators of the 
emotions. The results, in these early 
days, were almost uniformly negative. 
All of the emotional states experi- 


1 This experiment is part of a program of 
research on cognitive and physiological de- 
terminants of emotional state which is being 
conducted at the Department of Social Psy- 
chology at Columbia University under PHS 
Research Grant M-2584 from the National 
Institute of Mental Health, United States 
Public Health Service. This experiment was 
conducted at the Laboratory for Research in 
Social Relations at the University of Minne- 
sota. 

The authors wish to thank Jean Carlin and 
Ruth Hase, the physicians in the study, and 
Bibb Latané and Leonard Weller who were 
the paid participants. 
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mentally manipulated were character- 
ized by a general pattern of excitation 
of the sympathetic nervous system but 
there appeared to be no clear-cut physi- 
ological discriminators of the various 
emotions. This pattern of results was 
so consistent from experiment to ex- 
periment that Cannon (1929) offered, 
as one of the crucial criticisms of the 
James-Lange theory, the fact that “the 
same visceral changes occur in very dif- 
ferent emotional states and in non-emo- 
tional states” (p. 351). 

More recent work, however, has 
given some indication that there may 
be differentiators. Ax (1953) and 
Schachter (1957) studied fear and 
anger. On a large number of indices 
both of these states were characterized 
by a similarly high level of autonomic 
activation but on several indices they 
did differ in the degree of activation. 
Wolf and Wolff (1947) studied a sub- 
ject with a gastric fistula and were able 
to distinguish two patterns in the phys- 
iological responses of the stomach wall. 
It should be noted, though, that for 
many months they studied their subject 
during and following a great variety of 
moods and emotions and were able to 
distinguish only two patterns. 

Whether or not there are physiologi- 
cal distinctions among the various emo- 
tional states must be considered an open 
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question. Recent work might be taken 
to indicate that such differences are at 
best rather subtle and that the variety 
of emotion, mood, and feeling states are 
by no means matched by an equal vari- 
ety of visceral patterns. 

This rather ambiguous situation has 
led Ruckmick (1936), Hunt, Cole, and 
Reis (1958), Schachter (1959) and 
others to suggest that cognitive factors 
may be major determinants of emo- 
tional states. Granted a general pat- 
tern of sympathetic excitation as char- 
acteristic of emotional states, granted 
that there may be some differences in 
pattern from state to state, it is sug- 
gested that one labels, interprets, and 
identifies this stirred-up state in terms 
of the characteristics of the precipi- 
tating situation and one's apperceptive 
mass. This suggests, then, that an 
emotional state may be considered a 
function of a state of physiological 
arousal * and of a cognition appropriate 
to this state of arousal. The cognition, 
in a sense, exerts a Steering function. 
Cognitions arising from the immediate 
situation as interpreted by past experi- 
ence provide the framework within 
which one understands and labels his 
feelings. It is the cognition which de- 
termines whether the state of physio- 
logical arousal will be labeled as 
“anger,” “joy,” “fear,” or whatever, 

In order to examine the implications 
of this formulation let us consider the 
fashion in which these two elements, 
a state of physiological arousal and cog- 
nitive factors, would interact in a vari- 
ety of situations. In most emotion in- 
ducing situations, of course, the two 

2 Though our experiments are concerned 
exclusively with the „physiological changes 
produced by the injection of adrenalin, which 

appear to be primarily the result o E 
pathetic excitation, the term Hone logical 
arousal is used in preference to the more 
specific “excitation of the sympatien ne 
tem” because there are indications, 
UT s Mscnssed later, that this formulation 
te Pican to a variety of bodily states. 
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factors are completely interrelated. 
Imagine a man walking alone down a 
dark alley, a figure with a gun sud- 
denly appears. The perception-cogni- 
tion “figure with a gun” in some fash- 
ion initiates a state of physiological 
arousal; this state of arousal is inter- 
preted in terms of knowledge about 
dark alleys and guns and the state of 
arousal is labeled “fear.” Similarly a 
student who unexpectedly learns that 
he has made Phi Beta Kappa may ex- 
perience a state of arousal which he will 
label “joy,” 

Let us now consider circumstances in 
which these two elements, the physio- 
logical and the cognitive, are, to some 
extent, independent. First, is the state 
of physiological arousal alone sufficient 
to induce an emotion? Best evidence 
indicates that it is not. Marañon 
(1924), ina fascinating study, (which 
was replicated by Cantril & Hunt, 
1932, and Landis & Hunt, 1932) in- 
jected 210 of his patients with the sym- 
pathomimetic agent adrenalin and then 
simply asked them to introspect. Sev- 
enty-one percent of his subjects simply 
reported their physical symptoms with 
no emotional overtones; 29% of the 
subjects responded in an apparently 
emotional fashion. Of these the great 
majority described their feelings in a 
fashion that Marañon labeled “cold” or 
"as if" emotions, that is, they made 
statements such as "T feel as if I were 
afraid” or “as if I were awaiting a 
great happiness.” This is a sort of 
emotional "déjà vu” experience ; these 
subjects are neither happy nor afraid, 
they feel “as if” they were. Finally 2 
very few cases apparently reported à 
genuine emotional experience. How- 
ever, in order to produce this reaction 
in most of these few cases, Marañon 
(1924) points out: 


? Translated copies of Marafion’s 1920) 
Paper may be obtained by writing to th 
senior author. 
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One must suggest a memory with strong 
affective force but not so strong as to pro- 
duce an emotion in the normal state. For ex- 
ample, in several cases we spoke to our pa- 
tients before the injection of their sick 
children or dead parents and they responded 
calmly to this topic. The same topic pre- 
sented later, during the adrenal commotion, 
was sufficient to trigger emotion. This ad- 
renal commotion places the subject in a situ- 
ation of ‘affective imminence’ (pp. 307-308). 


Apparently, then, to produce a genu- 
inely emotional reaction to adrenalin, 
Marañon was forced to provide such 
subjects with an appropriate cognition. 

Though Marañon (1924) is not ex- 
plicit on his procedure, it is clear that 
his subjects knew that they were re- 
ceiving an injection and in all likeli- 
hood knew that they were receiving 
adrenalin and probably had some order 
of familiarity with its effects, In short, 
though they underwent the pattern of 
sympathetic discharge common to 
strong emotional states, at the same 
time they had a completely appropriate 
cognition or explanation as to why 
they felt this way. This, we would 
suggest, is the reason so few of Mara- 
fion's subjects reported any emotional 
experience. 

Consider now a person in a state of 
physiological arousal for which no im- 
mediately explanatory or appropriate 
cognitions are available. Such a state 
could result were one covertly to inject 
a subject with adrenalin or, unknown 
to him, feed the subject a sympathomi- 
metic drug such as ephedrine. Under 
such conditions a subject would be 
aware of palpitations, tremor, face 
flushing, and most of the battery of 
symptoms associated with a discharge 
of the sympathetic nervous system. In 
contrast to Marafion’s (1924) subjects 
he would, at the same time, be utterly 
unaware of why he felt this way. What 
would be the consequence of such a 
state? 

Schachter (1959) has suggested that 
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precisely such a state would lead to the 
arousal of “evaluative needs” (Festin- 
ger, 1954), that is, pressures would act 
on an individual in such a state to un- 
derstand and label his bodily feelings. 
His bodily state grossly resembles the 
condition in which it has been at times 
of emotional excitement. How would 
he label his present feelings? It is sug- 
gested, of course, that he will label his 
feelings in terms of his knowledge of 
the immediate situation Should he at 
the time be with a beautiful woman he 
might decide that he was wildly in love 
or sexually excited. Should he be at 
a gay party, he might, by comparing 
himself to others, decide that he was 
extremely happy and euphoric. Should 
he be arguing with his wife, he might 
explode in fury and hatred. Or, should 
the situation be completely inappropri- 
ate he could decide that he was excited 
about something that had recently hap- 
pened to him or, simply, that he was 
sick, In any case, it is our basic as- 
sumption that emotional states are a 
function of the interaction of such cog- 
nitive factors with a state of physiologi- 
cal arousal. 

This line of thought, then, leads to 
the following propositions: 

1. Given a state of physiological 
arousal for which an individual has no 
immediate explanation, he will “label” 
this state and describe his feelings in 
terms of the cognitions available to him. 
To the extent that cognitive factors are 
potent determiners of emotional states, 
it could be anticipated that precisely the 
same state of physiological arousal 
could be labeled “joy” or “fury” or 
"jealousy" or any of a great diversity 


4 This suggestion is not new for several 
psychologists have suggested that situational 
factors should be considered the chief differ- 
entiators of the emotions. Hunt, Cole, and 
Reis (1958) probably make this point most 
explicitly in their study distinguishing among 
fear, anger, and sorrow in terms of situ- 
ational characteristics. 
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of emotional labels depending on the 
cognitive aspects of the situation. , 

2. Given a state of physiological 
arousal for which an individual has a 
completely appropriate explanation 
(e.g., "I feel this way because I have 
just received an injection of adren- 
alin") no evaluative needs will arise 
and the individual is unlikely to label 
his feelings in terms of the alternative 
cognitions available. 

Finally, consider a condition in 
which emotion inducing cognitions are 
present but there is no state of physio- 
logical arousal. For example, an indi- 
vidual might be completely aware that 
he is in great danger but for some 
reason (drug or surgical) remain in a 
state of physiological quiescence. Does 
he experience the emotion "fear"? 
Our formulation of emotion as a joint 
function of a state of physiological 
arousal and an appropriate cognition, 
would, of course, suggest that he does 
not, which leads to our final proposi- 
tion. 

3. Given the same cognitive circum- 
stances, the individual will react emo- 
tionally or describe his feelings as emo- 
tions only to the extent that he 
experiences a state of physiological 
arousal.* 

PROCEDURE 


The experimental test of these propositions 
requires (a) the experimental manipulation 
of a state of physiological arousal, (b) the 
manipulation of the extent to which the sub- 
ject has an appropriate or proper explanation 
of his bodily state, and (c) the creation of 
situations from which explanatory cognitions 
may be derived. — 

In order to satisfy the first two experi- 
mental requirements, the experiment was cast 

5 In his critique of the James-Lange theory 
of emotion, Cannon (1929) also makes the 
point that sympathectomized animals an pa- 
tients do seem to manifest emotanal be- 
havior. This criticism is, of Ww as = 
plicable to the above proposition as à w ario 

James-Lange formulation. We sha 
> s the issues involved in later papers. 
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in the framework of a study of the effects of 
vitamin supplements on vision. As soon as 2 
subject arrived, he was taken to a pri 
room and told by the experimenter : 


In this experiment we would like to 
make various tests of your vision. We are 
particularly interested in how certain vita- 
min compounds and vitamin supplements 
affect the visual skills. In particular, we 
want to find out how the vitamin compound 
called ‘Suproxin’ affects your vision. — 

What we would like to do, then, if we 
Can get your permission, is to give you a 
small injection of Suproxin. The injec- 
tion itself is mild and harmless; however, 
Since some people do object to being in- 
jected we don't want to talk you into any- 
thing. Would you mind receiving a 
Suproxin injection? 


If the subject agrees to the injection (and 
all but 1 of 185 subjects did) the experi- 
menter continues with instructions we shall 
describe shortly, then leaves the room. In a 
Íew minutes a physician enters the room, 
briefly repeats the experimenter's instruc- 
tions, takes the subject's pulse and then in- 
jects him with Suproxin. 

Depending upon condition, the subject re- 
ceives one of two forms of Suproxin— 
epinephrine or a placebo. " 

Epinephrine or adrenalin is a sympathomi- 
metic drug whose effects, with minor excep- 
tions, are almost a perfect mimicry of @ 
discharge of the sympathetic nervous system. 
Shortly after injection systolic blood pressure 
increases markedly, heart rate increases 
somewhat, cutaneous blood flow decreases, 
while muscle and cerebral blood flow in- 
crease, blood sugar and lactic acid concentra- 
tion increase, and respiration rate increases 
slightly. As far as the subject is concerned 
the major subjective symptoms are palpita- 
tion, tremor, and sometimes a feeling of 
flushing and accelerated breathing, With a 
subcutaneous injection (in the dosage ad- 
ministered to our subjects), such effects usu- 
ally begin within 3-5 minutes of injection gne 
last anywhere from 10 minutes to an hour. 
For most subjects these effects are dissipate“ 
within 15-20 minutes after injection. f 

Subjects receiving epinephrine received p 
subcutaneous injection of % cubic centimete! 
of a 1:1000 solution of Winthrop Labora” 
tory’s Suprarenin, a saline solution © 
epinephrine bitartrate. , 

Subjects in the placebo condition received £ 
subcutaneous injection of 14 cubic centimete" 
of saline solution. This is, of course, com 
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pletely neutral material with no side effects 
at all. 


Manipulating an Appropriate 
Explanation 


By "appropriate" we refer to the extent to 
which the subject has an authoritative, un- 
equivocal explanation of his bodily condi- 
tion. Thus, a subject who had been informed 
by the physician that as a direct consequence 
of the injection he would feel palpitations, 
tremor, ete. would be considered to have a 
completely appropriate explanation. A sub- 
ject who had been informed only that the 
injection would have no side effects would 
have no appropriate explanation of his state. 
'This dimension of appropriateness was ma- 
nipulated in three experimental conditions 
which shall be called: Epinephrine Informed 
(Epi Inf), Epinephrine Ignorant (Epi Ign), 
and Epinephrine Misinformed (Epi Mis). 

Immediately after the subject had agreed 
to the injection and before the physician en- 
tered the room, the experimenter's spiel in 
each of these conditions went as follows: 


Epinephrine Informed. 1 should also tell 
you that some of our subjects have ex- 
perienced side effects from the Supros 
These side effects are transitory, that is, 
they will only last for about 15 or 20 min- 
utes. What will probably happen is that 
your hand will start to shake, your heart 
will start to pound, and your face may get 
warm and flushed. Again these are side 
effects lasting about 15 or 20 minutes. 


3 While the physician was giving the injec- 
tion, she told the subject that the injection 
was mild and harmless and repeated this de- 
Scription of the symptoms that the subject 
could expect as a consequence of the shot. In 
this condition, then, subjects have a com- 
pletely appropriate explanation of their bodily 
state. They know precisely what they will 
feel and why. i, 

Epinephrine Ignorant. In this condition 
when the subject agreed to the injection, the 
experimenter said nothing more relevant to 
side effects and simply left the room. While 
the physician was giving the injection, she 
told the subject that the injection was mild 
and harmless and would have no side effects, 
In this condition, then, the subject has no ex- 
perimentally provided explanation for his 
bodily state. 


Epinephrine Misinformed. I should also 
tell you that some of our subjects have e 
pevienced side effects from the Suproxin. 


These side effects are transitory, that is, 
they will only last for about 15 or 20 
minutes. What will probably happen is 
that your feet will feel numb, you will have 
an itching sensation over parts of your 
body, and you may get a slight headache. 
Again these are side effects lasting 15 or 
20 minutes. 


And again, the physician repeated these 
ymptoms while injecting the subject. 

None of these symptoms, of course, are 
consequences of an injection of epinephrine 
and, in effect, these instructions provide the 
subject with a completely inappropriate ex- 
planation of his bodily feclings. This condi- 
tion was introduced as a control condition of 
sorts. It seemed possible that the descrip- 
tion of side effects in the Epi Inf condition 
might turn the subject introspective, self-ex- 
amining, possibly slightly troubled. Differ- 
ences on the dependent variable between the 
Epi Inf and Epi Ign conditions might, then, 
be due to such factors rather than to dií- 
ferences in appropriateness. The false symp- 
toms in the Epi Mis condition should simi- 
larly turn the subject introspective, etc., but 
the instructions in this condition do not pro- 
vide an appropriate explanation of the sub- 
ject's state. 

Subjects in all of the above conditions were 
injected with epinephrine. Finally, there was 
a placebo condition in which subjects, who 
were injected with saline solution, were given 
precisely the same treatment as subjects in 
the Epi Ign condition. 


Producing an Emotion Inducing 
Cognition 


Our initial hypothesis has suggested that 
given a state of physiological arousal for 
which the individual has no adequate ex- 
planation, cognitive factors can lead the in- 
dividual to describe his feelings with any of 
a diversity of emotional labels. In order to 
test this hypothesis, it was decided to manipu- 
late emotional states which can be considered 
quite different—euphoria and anger. 

There are, of course, many ways to induce 
such states. In our own program of re- 
search, we have concentrated on social de- 
terminants of emotional states and have been 
able to demonstrate in other studies that 
people do evaluate their own feelings by com- 
paring themselves with others around them 
(Schachter 1959; Wrightsman 1960). In 
this experiment we have attempted again to 
manipulate emotional state by social means. 
In one set of conditions, the subject is placed 
together with a stooge who has been trained 
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to act euphorically. In a second set of con- 
ditions the subject is with a stooge trained 
to act in an angry fashion. 


Euphoria. 

Immediately 9 after the subject had been 
injected, the physician left the room and the 
experimenter returned with a stooge whom 
he introduced as another subject, then said: 


Both of you have had the Suproxin shot 
and you'll both be taking the same tests of 
vision. What I ask you to do now is just 
wait for 20 minutes. The reason for this 
is simply that we have to allow 20 minutes 
for the Suproxin to get from the injection 
site into the bloodstream. At the end of 
20 minutes when we are certain that most 
of the Suproxin has been absorbed into 
the bloodstream, we'll begin the tests of 
vision, 


The room in which this was said had been 
deliberately put into a state of mild disarray, 
As he was leaving, the experimenter apolo- 
getically added: 


The only other thing I should do is to 
apologize for the condition of the room, 
I just didn’t have time to clean it up. So, 
if you need any scratch paper or rubber 
bands or Pencils, help yourself. I'll be 
back in 20 minutes to begin the vision tests. 


As soon as the experimenter had left, the 
stooge introduced himself again, made a 
series of standard icebreaker comments, and 
then launched his routine, For Observation 
purposes, the stooge's act was broken into a 
series of standard units, demarcated by a 
change in activity or a standard comment. 
In sequence, the units of the stooge’s routine 
were the following: 


1. Stooge reaches for a piece of paper 
and starts doodling saying, "They said we 
could use this for scratch, didn't they?” 
He doodles a fish for some 30 seconds, then 


says: 


6 Tt was, of course, imperative that the se- 
quence with the stooge begin before the sub- 
ject felt his first symptoms for otherwise the 
subject would be virtually forced to interpret 
his feelings in terms of events preceding. the 
stooge's entrance. Pretests 1 had t indicated 
that, for most subjects, eppi i 
symptoms began within 3-5 minutes ee 
is ction. A deliberate attempt was made then 
OMS in the stooge within 1 minute after 
the subject's injection. 


2. "This scrap paper isn't even much 
good for doodling" and crumples paper and 
attempts to throw it into wastebasket in far 
Corner of the room. He misses but this 
leads him into a "basketball game" He 
crumples up other sheets of paper, shoots a 
few baskets, says “Two points" occasion- 
ally. He gets up and does a jump shot 
saying, "The old jump shot is really on 
today." 

3. If the subject has not joined in, the 
stooge throws a paper basketball to the 
subject saying, "Here, you try it." ` 
4. Stooge continues his game saying, 
The trouble with paper basketballs is that 
you don’t really have any control.” 

5. Stooge continues basketball, then 
gives it up saying, “This is one of my good 
days. I feel like a kid again. I think I'll 
make a plane.” He makes a paper airplane 
saying, "I guess I'll make one of the 
longer ones.” 

6. Stooge flies plane. Gets up and re- 
trieves plane, Flies again, etc. 

7. Stooge throws plane at subject. 

8. Stooge, flying plane, says, “Even when 
I was a kid, I was never much good at 
this." 2 

9. Stooge tears off part of plane saying, 
"Maybe this plane can't fly but at least 
it’s good for something.” He wads up 
Paper and making a slingshot of a rubber 
band begins to shoot the paper, 

10. Shooting, the stooge says, “They 
[paper ammunition] really go better if you 
make them long, They don't work right 
if you wad them up." 

11. While shooting, stooge notices a 
sloppy pile of manila folders on a table. 
He builds a tower of these folders, then 
goes to the opposite end of the room to 
shoot at the tower, 5 

12. He misses Several times, then hits 
and cheers as the tower falls; He goes 
over to pick up the folders, " 

13. While picking up, he notices, behind 
à portable blackboard, a pair of hula hoops 


"which have been covered with black tape 


with a few wires sticking out of the tape. 
He reaches for these, taking one for him- 
self and putting the other aside but within 
Teaching distance of the subject. The 
stooge tries the hula hoop, saying, "This 
isn't as easy as it looks." 

14. Stooge twirls hoop wildly on army 
saying, "Hey, look at this—this is great. 

15. Stooge replaces the hula hoop an 
Sits down with his feet on the table. 
Shortly thereafter the experimenter Te 
turns to the room. 


— 
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This routine was completely standard, 
though its pace, of course, varied depending 
upon the subject's reaction, the extent to 
which he entered into this bedlam and the 
extent to which he initiated activities of his 
own. The only variations from this standard 
routine were those forced by the subject. 
Should the subject originate some nonsense 
of his own and request the stooge to join in, 
he would do so. And, he would, of course, 
respond to any comments initiated by the 
subject. 

Subjects in each of the three "appropriate- 
ness" conditions and in the placebo condi- 
tion were submitted to this setup. The 
stooge, of course, never knew in which con- 
dition any particular subject fell, 


Anger 


Immediately after the injection, the ex- 
perimenter brought a stooge into the subject’s 
room, introduced the two and after explain- 
ing the necessity for a 20 minute delay for 
the Suproxin to get from the injection site 
into the bloodstream” he continued, “We 
would like you to use these 20 minutes to 
answer these questionnaires.” Then handing 
out the questionnaires, he concludes with, 
"I'll be back in 20 minutes to pick up the 
questionnaires and begin the tests of vision." 

Before looking at the questionnaire, the 
Stooge says to the subject, 


I really wanted to come for an experi- 
ment today, but I think it's unfair for them 
to give you shots. At least, they should 
have told us about the shots when they 
called us; you hate to refuse, once you're 
here already. 


The questionnaires, five pages long, start 
off innocently requesting face sheet informa- 
tion and then grow increasingly personal and 
insulting. The stooge, sitting directly op- 
posite the subject, paces his own answers so 
that at all times subject and stooge are work- 
ing on the same question. At regular points 
in the questionnaire, the stooge makes a 
series of standardized comments about the 
questions. His comments start off innocently 
enough, grow increasingly querulous, and 
finally he ends up in a rage. In sequence, 
he makes the following comments. 


1. Before answering any items, he leafs 
quickly through the questionnaire saying, 
“Boy, this is a long one.” f i 

2. Question 7 on the questionnaire re- 
quests, “List the foods that you would eat 
in a typical day.” The stooge comments, 


“Oh for Pete’s sake, what did I have for 
breakfast this morning ?” 

3. Question 9 asks, “Do you ever hear 
bells? . How often? — 
The stooge remarks, "Look at Question 9. 
How ridiculous can you get? I hear bells 
every time I change classes." £ 

4. Question 13 requests, “List the child- 
hood diseases you have had and the age 
at which you had them” to which the 
stooge remarks, “I get annoyed at this 
childhood disease question. I can't remem- 
ber what childhood diseases I had, and 
especially at what age. Can you?" 

5. Question 17 asks “What is your fath- 
er's average annual income?" and the 
stooge says, "This really irritates me. It’s 
none of their business what my father 
makes. I’m leaving that blank.” 

6. Question 25 presents a long series of 
items such as “Does not bathe or wash reg- 
ularly,” “Seems to need psychiatric care,” 
etc. and requests the respondent to write 
down for which member of his immediate 
family each item seems most applicable. 
The question specifically prohibits the an- 
swer “None” and each item must be an- 
swered. The stooge says, "I'll be damned if 
I'll fill out Number 25. ‘Does not bathe or 
wash regularly'Tthat's a real insult.” He 
then angrily crosses out the entire item. 

7. Question 28 reads : 

"How many times each week do you 
have sexual intercourse?" 0-1 2-3 
4-6 7 and over The 
stooge bites out, “The hell with it! I 
don't have to tell them all this." 

8. The stooge sits sullenly for a few mo- 
ments then he rips up his questionnaire, 
crumples the pieces and hurls them to the 
floor, saying, “I’m not wasting any more 
time. I'm getting my books and leaving" 
and he stamps out of the room. 

9. The questionnaire continues for eight 
more questions ending with: "With how 
many men (other than your father) has 
your mother had extramarital relation- 
ships?" 

4 and under 
over 


+ 5-9: — 10 sand 


Subjects in the Epi Ign, Epi Inf and 
Placebo conditions were run through this 
"anger" inducing sequence. The stooge, 
again, did not know to which condition the 
subject had been assigned. 

In summary, this is a seven condition ex- 
periment which, for two different emotional 
states, allows us (a) to evaluate the effects 
of “appropriateness” on emotional inducibility 
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and (b) to begin to evaluate the effects of 
sympathetic activation on emotional induci- 
bility. In schematic form the conditions are 
the following: 


EUPHORIA ANGER 
Epi Inf Epi Inf 
Epi Ign Epi Ign 
Epi Mis Placebo 


Placebo | 


The Epi Mis condition was not run in the 
Anger sequence. "This was originally con- 
ceived as a control condition and it was felt 
that its inclusion in the Euphoria conditions 
alone would suffice as a means of evaluating 
the possible artifactual effect of the Epi Inf 
instructions. 


Measurement 


Two types of measures of emotional state 
were obtained. Standardized observation 
through a One-way mirror was the technique 
used to assess the subject's behavior. To 
what extent did he act euphoric or angry? 
Such behavior can be considered in a way as 
a “semiprivate” index of mood for as far as 
the subject was concerned, his emotional be- 
havior could be known only to the other per- 
son in the Toom—presumably another stu- 
dent. The second type of measure was selí- 
Teport in which, on a variety of scales, the 
subject indicated his mood of the moment, 
Such measures can be considered “public” 
indices of mood for they would, of course, 
be available to the experimenter and his 
associates, 


Observation 


Euphoria. For each of the first 14 units 
of the stooge's standardized routine an ob- 
server kept a running chronicle of what the 
subject did and said. „For each unit the ob- 
server coded the subjects behavior in one 
or more of the following categories : 

Category 1: Joins in activity. li the sub- 
ject entered into the stooge's activities, e.g., 
if he made or flew airplanes, threw paper 
basketballs, hula hooped, etc. his behavior 

a led in this category. 2. 

S eee 2: Initiates new activity. A sub- 
ject was so coded if he gave indications of 
creative euphoria, that is, if, on his own, he 
initiated behavior outside of the stooge’s rou- 
tine. Instances of such behavior would vi 
the subject who threw open the window and, 
laughing, hurled paper basketballs at E 
ersby ; or, the subject who jumped opem o 
and spun one m hoop on his leg anc 

cu E and 4: Ignores or watches 
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stooge. Subjects who paid flatly no atten- 
tion to the stooge or who, with or without 
comment, simply watched the stooge without 
joining in his activity were coded in these 
categories, 

For any particular unit of behavior, the 
subject’s behavior was coded in one or more 
of these categories, To test reliability of 
coding two observers independently coded 
two experimental sessions. The observers 
agreed completely on the coding of 88% 
of the units, 

Anger. For each of the units of stooge 
behavior, an observer recorded the subject's 
Tesponses and coded them according to the 
following category scheme: 

Category 1: Agrees. In response to the 
stooge the subject makes a comment indi- 
cating that he agrees with the stooge's stand- 
ardized comment or that he, too, is irked by 
a particular item on the questionnaire. For 
example, a subject who responded to the 
stooge’s comment on the "father's income 
question by saying, "T don't like that kind 
of personal question either" would be so 
coded (scored :E2).. 

Category 2: Disagrees, In response to the 
Stooge's comment, the subject makes a com- 
ment which indicates that he disagrees with 
the stooge's meaning or mood; eg, in re- 
sponse to the stooge’s comment on the “fath- 
er's income" question, such a subject might 
say, "Take it easy, they probably have a 
good reason for wanting the information” 
(scored —2). 

Category 3: Neutral. A noncommittal or 
irrelevant Tesponse to the stooge's remark 
(scored 0). 

Category 4: Initiates agreement or dis- 
agreement. With no instigation by the 
stooge, a subject, so coded, would have 
volunteered a remark indicating that he felt 
the same way or, alternatively, quite differ- 
ently than the stooge. Examples would be 
“Boy I hate this kind of thing" or “I’m en- 
Joying this" (scored +2 or —2). 

Category 5: Watches. The subject makes 
no verbal response to the stooge’s comment 
but simply looks directly at him (scored 0)- 

Category 6: Ignores. The subject makes 
no verbal response to the stooge’s comment 
nor does he look at him; the subject, paying 
no attention at all to the stooge, simply works 
at his own questionnaire (scored =). 

A subject was scored in one or more of 
these categories for each unit of stooge be- 
havior. To test reliability, two observers in- 
dependently coded three experimental ses- 
sions. In order to get a behavioral index 
of anger, observation Protocol was scored ač- 
cording to the values Presented in parentheses 


i 
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after each of the above definitions of cate- 
gories. In a unit-by-unit comparison, the two 
Observers agreed completely on the scoring 
of 7196 of the units jointly observed. The 
scores of the two observers differed by a 
value of 1 or less for 889 of the units coded 
and in not a single case did the two observers 
differ in the direction of their scoring of 
a unit. 


Self Report of Mood and Physical 
Condition 


When the subject's session with the stooge 
was completed, the experimenter returned to 
the room, took pulses and said: 


Before we proceed with the vision tests, 
there is one other kind of information 
which we must have. We have found, as 
you can probably imagine, that there are 
many things beside Suproxin that affect 
how well you see in our tests. How 
hungry you are, how tired you are, and 
even the mood you're in at the time— 
whether you feel happy or irritated at the 
time of testing will affect how well you 
see. To understand the data we collect on 
you, then, we must be able to figure out 
Which effects are due to causes such as 
these and which are caused by Suproxin. 

i The only way we can get such informa- 

tion about your physical and emotional 
state is to have you tell us. I'll hand out 
these questionnaires and ask you to an- 
swer them as accurately as possible. Ob- 
viously, our data on the vision tests will 
only be as accurate as your description of 
your mental and physical state. 


In keeping with this spiel, the questionnaire 
that the experimenter passed out contained a 
number of mock questions about hunger, 
fatigue, etc, as well as questions of more 
immediate relevance to the experiment. To 
measure mood or emotional state the follow- 
ing two were the crucial questions: 


1. How irritated, angry or annoyed would 
you say you feel at present? 
| l I 


font I feel I feel If 
feelatall alittle — quite very extremely 
irritated irritated irritated irritated irritated 
or angry andangry and angry and angry and angry 
toy Q) (4) 


I feel 


2. How good or happy would you say you 
feel at present? 


| | ! ! l 


I feel T feel I feel 


T don't I feel 
feel atall — a little quite very — extremely 
happy. happy happy happy happy 
or good and good and good and good and good 
(0) a) Q) G) (4) 


To measure the physical effects of epi- 
nephrine and determine whether or not the 
injection had been successful in producing the 
necessary bodily state, the following questions 
were asked: 


l | li — 


Not at A slight A moderate An intense 
all amount amount amount 
(0) a) (2) (3) 


2. Did you feel any tremor (involuntary 
shaking of the hands, arms or legs)? 


l | | 


Not at A slight A moderate An intense 
a amount amount amount 
(0) a) (2) (3) 


To measure possible effects of the instruc- 
tions in the Epi Mis condition, the following 
questions were asked : 

1. Did you feel any numbness in your feet? 

2. Did you feel any itching sensation? 

3. Did you experience any feeling of head- 
ache? 

To all three of these questions was at- 
tached a four-point scale running from "Not 
at all” to “An intense amount.” 

In addition to these scales, the subjects 
were asked to answer two open-end questions 
on other physical or emotional sensations 
they may have experienced during the experi- 
mental session. A final measure of bodily 
state was pulse rate which was taken by the 
physician or the experimenter at two times— 
immediately before the injection and immedi- 
ately after the session with the stooge. 

When the subjects had completed these 
questionnaires, the experimenter announced 
that the experiment was over, explained 
the deception and its necessity in detail, an- 
swered any questions, and swore the subjects 
to secrecy, Finally, the subjects answered a 
brief questionnaire about their experiences, if 
any, with adrenalin and their previous knowl- 
edge or suspicion of the experimental setup. 
There was no indication that any of the sub- 
jects had known about the experiment before- 
hand but 11 subjects were so extremely sus- 
picious of some crucial feature of the 
experiment that their data were automatically 
discarded. 


Subjects 


The subjects were all male, college stu- 
dents taking classes in introductory psychol- 
ogy at the University of Minnesota. Some 
90% of the students in these classes volunteer 
for a subject pool for which they receive two 
extra points on their final exam for every 
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hour that they serve as experimental sub- 
jects. For this study the records of all po- 
tential subjects were cleared with the Student 
Health Service in order to insure that no 
harmful effects would result from the 
injections. 


Evaluation of the Experimental Design 


The ideal test of our propositions would 
require circumstances which our experiment 
is far from realizing. First, the proposition 
that: "A state of physiological arousal for 
which an individual has no immediate ex- 
planation will lead him to label this state in 
terms of the cognitions available to him” 
obviously requires conditions under which 
the subject does not and cannot have a proper 
explanation of his bodily state. Though we 
toyed with such fantasies as ventilating the 
experimental room with vaporized adrenalin, 
reality forced us to rely on the disguised in- 
jection of Suproxin—a technique which was 
far from ideal for no matter what the ex- 
perimenter told them, some subjects would 
inevitably attribute their feelings to the in- 
jection. To the extent that subjects did so, 
differences between the several appropriate- 
ness conditions should be attenuated. 

Second, the proposition that: “Given the 
same cognitive circumstances the individual 
will react emotionally only to the extent that 
he experiences a state of physiological 
arousal” requires for its ideal test the ma- 
nipulation of states of Physiological arousal 
and of physiological quiescence, Though 
there is no question that epinephrine effec- 
tively produces a state of arousal, there is 
also no question that a placebo does not pre- 
vent physiological arousal. To the extent 
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that the experimental situation effectively 
produces sympathetic stimulation in placebo 
subjects, the proposition is difficult to test, for 
such a factor would attenuate differences be- 
tween epinephrine and placebo subjects. 

Both of these factors, then, can be expected 
to interfere with the test of our several 
Propositions. In presenting the results of 
this study, we shall first present condition by 
condition results and then evaluate the effect 
of these two factors on experimental differ- 
ences. 


RESULTS 


Effects of the Injections on Bodily 
State 


Let us examine first the success of 
the injections at producing the bodily 
state required to examine the proposi- 
tions at test. Does the injection. of 
epinephrine produce symptoms of sym- 
pathetic discharge as compared with 
the placebo injection? Relevant data 
are presented in Table 1 where it can 
be immediately seen that on all items 
subjects who were in epinephrine con- 
ditions show considerably more evi- 
dence of sympathetic activation than do 
subjects in placebo conditions. In all 
epinephrine conditions pulse rate in- 
creases significantly when compared 
with the decrease characteristic of the 
placebo conditions, On the scales it is 
clear that epinephrine subjects experi- 


TABLE 1 
THE EFFECTS OF THE INJECTIONS ON BoniLy Stati 


Pulse Self-rating of 
Condition N 
Pre Post Palpitation | Tremor Numbness | Itching | Headache 
Suphoria 
n Inf | 27 85.7 88.6 1.20 1.43 0 0.16 0.32 
Epilgn | 26 84.6 85.6 1.83 1.76 0.15 0 0.55 
Epi Mis | 26 82.9 86.0 1.27 2.00 0.06 0.08 0.23 
Placebo | 26 | 804 774A 0.29 0.21 0.09 0 0.27 
Anger 4 1.26 0.11 
25i Inf 23 85.9 92. . 1.41 0.17 0 . 
Euh aj wo | ee 144 178 | 0 006 | 021 
Phcebo | 43 | 845 | 79.6 0.59 0.24 | 014 | 0.06 | 0.06 
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ence considerably more palpitation and 
tremor than do placebo subjects. In 
all possible comparisons on these symp- 
toms, the mean scores of subjects in 
any of the epinephrine conditions are 
greater than the corresponding scores 
in the placebo conditions at better than 
the .001 level of significance. Exami- 
nation of the absolute values of these 
scores makes it quite clear that subjects 
in epinephrine conditions were, indeed, 
in a state of physiological arousal, while 
most subjects in placebo conditions 
were in a relative state of physiological 
quiescence. 

The epinephrine injection, of course, 
did not work with equal effectiveness 
for all subjects; indeed for a few sub- 
jects it did not work at all. Such sub- 
jects reported almost no palpitation or 
tremor, showed no increase in pulse 
and described no other relevant phys- 
ical symptoms. Since for such subjects 
the necessary experimental conditions 
were not established, they were auto- 
matically excluded from the data and 
all further tabular presentations will not 
include such subjects. Table 1, how- 
ever, does include the data of these sub- 
jects. There were four such subjects 
in euphoria conditions and one of them 
in anger conditions. 

In order to evaluate further data on 
Epi Mis subjects it is necessary to note 
the results of the “numbness,” “itch- 
ing,” and “headache” scales also pre- 
sented in Table 1. Clearly the subjects 
in the Epi Mis condition do not differ 
on these scales from subjects in any of 
the other experimental conditions. 


Effects of the Manipulations on 
Emotional State 


Euphoria: Self-report. The effects 
of the several manipulations on emo- 
tional state in the euphoria conditions 
are presented in Table 2. The scores 
recorded in this table are derived, for 
each subject, by subtracting the value 
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of the point he checks on the irritation 
scale from the value of the point he 
checks on the happiness scale. Thus, 
if a subject were to check the point "I 
ieel a little irritated and angry” on the 
irritation scale and the point “I feel 
very happy and good” on the happiness 
scale, his score would be +2. The 
higher the positive value, the happier 
and better the subject reports himself 
as feeling. Though we employ an in- 
dex for expositional simplicity, it 
should be noted that the two com- 
ponents of the index each yield results 
completely consistent with those ob- 
tained by use of this index. 

Let us examine first the effects of the 
appropriateness instructions. Compari- 
son of the scores for the Epi Mis and 
Epi Inf conditions makes it immediately 
clear that the experimental differences 
are not due to artifacts resulting from 
the informed instructions. In both con- 
ditions the subject was warned to ex- 
pect a variety of symptoms as a conse- 
quence of the injection. In the Epi 
Mis condition, where the symptoms 
were inappropriate to the subject's 
bodily state the self-report score is al- 
most twice that in the Epi Inf condition 
where the symptoms were completely 
appropriate to the subject's bodily state. 
It is reasonable, then, to attribute dif- 
ferences between informed subjects 
and those in other conditions to dif- 
ferences in manipulated appropriate- 
ness rather than to artifacts such as 
introspectiveness or self-examination, 

It is clear that, consistent with expec- 
tations, subjects were more susceptible 
to the stooge's mood and consequently 
more euphoric when they had no ex- 
planation of their own bodily states 
than when they did. The means of 
both the Epi Ign and Epi Mis condi- 
tions are considerably greater than the 
mean of the Epi Inf condition. 

It is of interest to note that Epi Mis 
subjects are somewhat more euphoric 
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TABLE 2 


SELF-REPORT OF EMOTIONAL STATE IN 
THE EUPHORIA CONDITIONS 


" Self- 
Condi- Report Comparison p 
tion scales 
25 0.98 «.01 
25 1:78 -02 
D is | 25 1.90 ns 
Placebo | 26 | 1.61 Ign, or Inf 


All p values reported throughout paper are two- 
tailed. 


than are Epi Ign subjects. This pat- 
tern repeats itself in other data shortly 
to be presented. We would attribute 
this difference to differences in the ap- 
propriateness dimension. Though, as 
in the Epi Ign condition, a subject is 
not provided with an explanation of his 
bodily state, it is, of course, possible 
that he will provide one for himself 
which is not derived from his interac- 
tion with the stooge. Most reasonably 
he could decide for himself that he 
feels this way because of the injection. 
To the extent that he does so he should 
be less susceptible to the stooge. Tt 
seems probable that he would he less 
likely to hit on such an explanation in 
the Epi Mis condition than in the Epi 
Ign condition for in the Epi Mis condi- 
tion both the experimenter and the doc- 
tor have told him that the effects of the 
injection would be quite different from 
what he actually feels. The effect of 
such instructions is probably to make 
it more difficult for the subject himself 
to hit on the alternative explanation de- 
scribed above. There is some evidence 
to support this analysis. In open-end 
questions in which subjects described 
their own mood and state, 28% of the 
subjects in the Epi Ign condition made 
some connection between the injection 
and their bodily state compared. with 
the 16% of subjects in the Epi Mis 
condition who did so. It could be cons 
sidered, then, that these three condi- 
POER " limension of appropri- 
tions fall along a dime 
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ateness, with the Epi Inf condition at 
one extreme and the Epi Mis condition 
at the other. 

Comparing the placebo to the epi- 
nephrine conditions, we note a pattern 
which will repeat itself throughout the 
data. Placebo subjects are less cu- 
phoric than either Epi Mis or Epi Ign 
subjects but somewhat more euphoric 
than Epi Inf subjects. These differ- 
ences are not, however, statistically 
Significant. We shall consider the 
epinephrine-placebo comparisons in de- 
tail in a later section of this paper fol- 
lowing the presentation of additional 
relevant data. For the moment, it is 
clear that, by self-report manipulating 
appropriateness has had a very strong 
effect on euphoria. 

Behavior. Let us next examine the 
extent to which the subject’s behavior 
was affected by the experimental ma- 
nipulations. To the extent that his 
mood has been affected, one should ex- 
pect that the subject will join in the 
stooge’s whirl of manic activity and 
initiate similar activities of his own. 
The relevant data are presented in 
Table 3. The column labeled “Activity 


TABLE 3 


BEHAVIORAL INDICATIONS OF 


EMOTIONAL 


STATE IN THE EUPHORIA CONDITIONS 
bon Mean num- 
Conditi " Activit. “Dei acts 
ondition N tae oer a 2 
Epi Inf 25 12722 | .20 
Epi Ign 25 18.28 56 
Epi Mis 25 22.56 84 
Placebo 26 16.00 54 
p value 
Comparison Activity | Initiates 
M index 
Epi Inf vs. Epi Mis .05 03 
Zpi Inf vs. i ns .08 
ns Hs 


sect ested by X? comparison of the proportion of 
Jects in each condition initiating new acts. 


sub- 


ES "Las. —- 


Dr 


index" presents summary figures on the 
extent to which the subject joined in 
the stooge's activity. This is a 
weighted index which reflects both the 
nature of the activities in which the 
subject engaged and the amount of time 
he was active. The index was devised 
by assigning the following weights to 
the subject’s activities: 5—hula hoop- 
ing; 4—shooting with slingshot; 3— 
paper airplanes; 2— paper basketballs ; 
1—doodling ; 0—does nothing. Pre- 
test scaling on 15 college students 
ordered these activities with respect to 
the degree of euphoria they repre- 
sented. Arbitrary weights were as- 
signed so that the wilder the activity, 
the heavier the weight. These weights 
are multiplied by an estimate of the 
amount of time the subject spent in 
each activity and the summed products 
make up the activity index for each sub- 
ject. This index may be considered a 
measure of behavioral euphoria. It 
should be noted that the same between- 
condition relationships hold for the two 
components of this index as for the 
index itself, 

The column labeled *Mean number 
of acts initiated” presents the data on 
the extent to which the subject deviates 
from the stooge’s routine and initiates 
euphoric activities of his own. 

On both behavioral indices, we find 
precisely the same pattern of relation- 
ships as those obtained with self-re- 
ports. Epi Mis subjects behave some- 
what more euphorically than do Epi 
Ign subjects who in turn behave more 
euphorically than do Epi Inf subjects. 
On all measures, then, there is con- 
sistent evidence that a subject will take 
over the stooge’s euphoric mood to the 
extent that he has no other explana- 
tion of his bodily state. 

Again it should be noted that on 
these behavioral indices, Epi Ign and 
Epi Mis subjects are somewhat more 
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euphoric than placebo subjects but not 
significantly so. 

lager: Self-report. Before present- 
ing data for the anger conditions, one 
point must be made about the anger ma- 
nipulation. In the situation devised, 
anger, if manifested, is most likely to be 
directed at the experimenter and his 
annoyingly personal questionnaire. As 
we subsequently discovered, this was 
rather unfortunate, for the subjects, 
who had volunteered for the experiment 
for extra points on their final exam, 
simply refused to endanger these points 
by publicly blowing up, admitting their 
irritation to the experimenter's face or 
spoiling the questionnaire. Though as 
the reader will see, the subjects were 
quite willing to manifest anger when 
they were alone with the stooge, they 
hesitated to do so on material (self- 
ratings of mood and questionnaire) 
that the experimenter might see and 
only after the purposes of the experi- 
ment had been revealed were many of 
these subjects willing to admit to the 
experimenter that they had been irked 
or irritated. 

This experimentally unfortunate situ- 
ation pretty much forces us to rely on 
the behavioral indices derived from ob- 
servation of the subject's presumably 
private interaction with the stooge. 
We do, however, present data on the 
self-report scales in Table 4. These 
figures are derived in the same way as 
the figures presented in Table 2 for the 
euphoria conditions, that is, the value 
checked on the irritation scale is sub- 
tracted from the value checked on the 
happiness scale. "Though, for the rea- 
sons stated above, the absolute magni- 
tude of these figures (all positive) is 
relatively meaningless, we can, of 
course, compare condition means within 
the set of anger conditions. With the 
happiness-irritation index employed, 
we should, of course, anticipate pre- 
cisely the reverse results from those ob- 
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TABLE 4 


SELF-REPORT OF EMOTIONAL STATE IN 
THE ANGER CONDITIONS 


Self- | , 
Condition| N | Report Comparison | > 
K scales | 
Epi Inf |22 | 1.91 | Epi Inf vs. Epi | .08 
Ign a 
Epi Ign | 23 | 1.39 | Placebo vs. Epi | ns 
Ign or Inf 
Placebo | 23 | 1.63 | 


tained in the euphoria conditions; that 
is, the Epi Inf subjects in the anger 
conditions should again be less suscep- 
tible to the stooge's mood and should, 
therefore, describe themselves as in a 
somewhat happier frame of mind than 
subjects in the Epi Ign condition. 
This is the case; the Epi Inf subjects 
average 1.91 on the self-report scales 
while the Epi Ign subjects average 
1.39. 

Evaluating the effects of the injec- 
tions, we note again that, as antici- 
pated, Epi Ign subjects are somewhat 
less happy than Placebo subjects but, 
once more, this is not a significant 
difference. 

Behavior. The subject's responses 
to the stooge, during the period when 
both were filling out their question- 
naires, were systematically coded to 
provide a behavioral index of anger. 
The coding scheme and the numerical 
values attached to each of the categories 
have been described in the methodology 
section. To arrive at an “Anger in- 
dex” the numerical value assigned to 
a subject’s responses to the stooge is 
summed together. for the several units 
of stooge behavior. | In the coding 
scheme used, a positive value to this 
index indicates that the subject agrees 
with the stooge's comment and a a 

ing angry. A negative value in Pes 
that the subject either disagrees wi 


i him. 
stooge or ignores k 
s E ene data are presented in 
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Table 5. For this analysis, the stooge's 
routine has been divided into two 
phases—the first two units of his be- 
havior (the "long" questionnaire and 
"What did I have for breakfast?") are 
considered essentially neutral revealing 
nothing of the stooge's mood ; all of the 
following units are considered "angry" 
units for they begin with an irritated re- 
mark about the “bells” question and 
end with the stooge’s fury as he rips 
up his questionnaire and stomps out 
of the room. For the neutral units, 
agreement or disagreement with the 
stooge’s remarks is, of course, mean- 
ingless as an index of mood and we 
should anticipate no difference between 
conditions. As can be seen in Table 
5, this is the case. 

For the angry units, we must, of 
course, anticipate that subjects in the 
Epi Ign condition will be angrier than 
subjects in the Epi Inf condition. This 
is indeed the case. The Anger index 
for the Epi Ign condition is positive 
and large, indicating that these subjects 
have become angry, while in the Epi 
Inf condition the Anger index is 
slightly negative in value indicating 
that these subjects have failed to catch 
the stooge's mood at all. It seems clear 
that providing the subject with an ap- 


TABLE 5 


BEHAVIORAL INDICATIONS OF EMOTIONAL 
STATE IN THE ANGER CONDITIONS 


Condition N Neutral Angen 
Epi Inf 22 +0.07 —0.18 
Epi Ign 23 +0.30 +2.28 
Placebo 228 —0.09 +0.79 
Comparison for anger units p 
Epi Inf vs. Epi Ign «.01 
Epi Ign vs. Placebo «.05 
Placebo vs. Epi Inf ns 


$ For one subject in this condition the sound system 
Went dead and the observer could not, of course, C! 
his reactions, 
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propriate explanation of his bodily 
state greatly reduces his tendency to 
interpret his state in terms of the cog- 
nitions provided by the stooge's angry 
behavior. 

Finally, on this behavioral index, it 
can be seen that subjects in the Epi Ign 
condition are significantly angrier than 
subjects in the Placebo condition. Be- 
haviorally, at least, the injection of epi- 
nephrine appears to have led subjects 
to an angrier state than comparable 
subjects who received placebo shots. 


Conformation of Data to Theoretical 
Expectations 


Now that the basic data of this study 
have been presented, let us examine 
closely the extent to which they con- 
form to theoretical expectations. If our 
hypotheses are correct and if this ex- 
perimental design provided a perfect 
test for these hypotheses, it should be 
anticipated that in the euphoria condi- 
tions the degree of experimentally pro- 
duced euphoria should vary in the 
following fashion: 


Epi Mis = Epi Ign > Epi Inf = Placebo 


And in the anger conditions, anger 
should conform to the following pat- 
tern : 


Epi Ign > Epi Inf = Placebo 


In both sets of conditions, it is the 
case that emotional level in the Epi 
Mis and Epi Ign conditions is con- 
siderably greater than that achieved in 
the corresponding Epi Inf conditions, 
The results for the Placebo condition, 
however, are ambiguous for consist- 
ently the Placebo subjects fall between 
the Epi Ign and the Epi Inf subjects, 
This is a particularly troubling pattern 
for it makes it impossible to evaluate 
unequivocally the effects of the state of 
physiological arousal and indeed raises 
serious questions about our entire theo- 
retical structure. Though the emo- 
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tional level is consistently greater in the 
Epi Mis and Epi Ign conditions than 
in the Placebo condition, this differ- 
ence is significant at acceptable prob- 
ability levels only in the anger con- 
ditions. 

In order to explore the problem fur- 
ther, let us examine the experimental 
factors identified earlier, which might 
have acted to restrain the emotional 
level in the Epi Ign and Epi Mis con- 
ditions. As was pointed out earlier, 
the ideal test of our first two hypotheses 
requires an experimental setup in which 
the subject has flatly no way of evalu- 
ating his state of physiological arousal 
other than by means of the experimen- 
tally provided cognitions. Had it been 
possible to physiologically produce a 
state of sympathetic activation by means 
other than injection, one could have 
approached this experimental ideal 
more closely than in the present setup. 
As it stands, however, there is always 
a reasonable alternative cognition avail- 
able to the aroused subject—he feels 
the way he does because of the injec- 
tion, To the extent that the subject 
seizes on such an explanation of his 
bodily state, we should expect that he 
will be uninfluenced by the stooge. 
Evidence presented in Table 6 for the 
anger condition and in Table 7 for the 
euphoria conditions indicates that this 
is, indeed, the case. 

As mentioned earlier, some of the 
Epi Ign and Epi Mis subjects in their 
answers to the open-end questions 
clearly attributed their physical state 
to the injection, e.g., "the shot gave 
me the shivers.” In Tables 6 and 7 
such subjects are labeled "'Selt-in- 
formed.” In Table 6 it can be seen 
that the self-informed subjects are con- 
siderably less angry than are the re- 
maining subjects; indeed, they are not 
angry at all. With these self-informed 
subjects eliminated the difference be- 
tween the Epi Ign and the Placebo 
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TABLE 6 


ICTS OF ATTRIBUTING BopiLy STATE 
TO THE INJECTION ON ANGER IN THE 
ANGER Ert IGN CONDITION 


z ^] 
Condition | | WEST a 

| 

|__| 
Self-informed subjects 3 —1.67 | ns 
ome | 20 Fass | ns 
Self-informed vs. 05 
Others 


conditions is significant at the .01 level 
of significance. 

Precisely the same pattern is evident 
in Table 7 for the euphoria conditions. 
In both the Epi Mis and the Epi Ign 
conditions, the self-informed subjects 
have considerably lower activity indices 
than do the remaining subjects. Elim- 
inating self-informed subjects, compari- 
son of both of these conditions with the 
Placebo condition yields a difference 
significant at the .03 level of signifi- 
cance. It should be noted, too, that the 
self-informed subjects have much the 
same score on the activity index as do 
the experimental Epi Inf subjects 
(Table 3). 

It would appear, then, that the ex- 
perimental procedure of injecting the 
subjects, by providing an alternative 
cognition, has, to some extent, obscured 
the effects of epinephrine. When ac- 
count is taken of this artifact, the evi- 
dence is good that the state of physi- 
ological arousal is à necessary com- 
ponent of an emotional experience for 
when self-informed subjects Are re- 
moved, epinephrine subjects give con- 
sistent indications of greater emotion- 
ality than do placebo subjects. 

Let us examine next the fact that 
consistently the emotional level, both 
reported and behavioral, in Placebo 
conditions is greater than that in the 
Epi Inf conditions. Theoretically, of 
course, it should be expected that ae 
two conditions will be equally low, for 
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by assuming that emotional state is a 
joint function of a state of physiologi- 
cal arousal and of the appropriateness 
of a cognition we are, in effect, as- 
suming a multiplicative function, so 
that if either component is at zero, emo- 
tional level is at zero. As noted earlier 
this expectation should hold if we can 
be sure that there is no sympathetic 
activation in the Placebo conditions. 
This assumption, of course, is com- 
pletely unrealistic for the injection of 
Placebo does not prevent sympathetic 
activation. The experimental situa- 
lions were fairly dramatic and cer- 
tainly some of the placebo subjects gave 
indications of physiological arousal. If 
our general line of reasoning is correct, 
it should be anticipated that the emo- 
tional level of subjects who give indi- 
cations of Sympathetic activity will be 
greater than that of subjects who do 
not. The relevant evidence is pre- 
sented in Tables 8 and 9, 

As an index of Sympathetic activa- 
tion we shall use the most direct and 
unequivocal measure available—change 


TABLE 7 


ICTS OF ATTRIBUTING BopiLY STATE 
TO THE [INJECTION ON EUPHORIA IN 

THE EUPHORIA EPI IGN AND 
Ert Mts CONDITIONS 


THE 


Epi Ign 


Activity 
N | Index | ? 


Self-informed subjects 8 


: 11.63 | ns 
ers 17 21.14 "T 
Self-informed vs, 05 


thers 


Epi Mis 


N Activity | p 


Index 
Self-informed subjects 5 12.40 | zs 
Others 20 | 25.10 | "s 
Self-informed vs. 10 


thers 
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TABLE 8 


IC ACTIVATION AND EUPHORIA 
PHORIA PLACEBO CONDITION 


SYMPATHE! 
IN THE 


| Acti 
ies d r ctivity 
Subjects whose: | N | index | ^ 
ated ae — » 
Pulse decreased 14 | ns 
Pulse increased 12 ns 


or remained same | 
Pulse decrease vs. pulse | 02 
increase or same | 


in pulse rate. It can be seen in Table 
l that the predominant pattern in the 
Placebo condition is a decrease in pulse 
rate. We shall assume, therefore, that 
those subjects whose pulse increases or 
remains the same give indications of 
sympathetic activity while those sub- 
jects whose pulse decreases do not. In 
Table 8, for the euphoria condition, it 
is immediately clear that subjects who 
give indications of sympathetic activity 
are considerably more euphoric than 
are subjects who show no sympathetic 
activity. This relationship is, of course, 
confounded by the fact that euphoric 
subjects are considerably more active 
than noneuphoric subjects—a factor 
which independent of mood could ele- 
vate pulse rate. However, no such fac- 
tor operates in the anger condition 
where angry subjects are neither more 
active nor talkative than calm subjects. 
It can be seen in Table 9 that Placebo 
subjects who show signs of sympathetic 


TABLE 9 


SYMPATHETIC ACTIVATION AND ANGER IN 
ANGER PLACEBO CONDITION 


‘ " va | Anger 

Subjects whose: N tudes 
Pulse decreased 13 | +0.15 | ns 
Pulse increased 8 | +1.69 | ns 


or remained same 
Pusle decrease vs. pulse 
increase or same .01 


^N reduced by two cases owing to failure of sound 
System in one case and experimenter's failure to take 
pulse in another. 


activation give indications of consider- 
ably more anger than do subjects who 
show no such signs. Conforming to 
expectations, sympathetic activation ac- 
companies an increase in emotional 
level. 

It should be noted, too, that the emo- 
tional levels of subjects showing no 
signs of sympathetic activity are quite 
comparable to the emotional level of 
subjects in the parallel Epi Inf condi- 
tions (see Tables 3 and 5). The 
similarity of these sets of scores and 
their uniformly low level of indicated 
emotionality would certainly make it 
appear that both factors are essential 
to an emotional state. When either the 
level of sympathetic arousal is low or 
a completely appropriate cognition i 
available, the level of emotionality is 
low. 


a 


DISCUSSION 


Let us summarize the major findings 
of this experiment and examine the ex- 
tent to which they support the proposi- 
tions offered in the introduction of this 
paper. It has been suggested, first, that 
given a state of physiological arousal 
for which an individual has no explana- 
tion, he will label this state in terms of 
the cognitions available to him. This 
implies, of course, that by manipulating 
the cognitions of an individual in such 
a state we can manipulate his feelings 
in diverse directions. Experimental re- 
sults support this proposition for fol- 
lowing the injection of epinephrine, 
those subjects who had no explanation 
for the bodily state thus produced, gave 
behavioral and self-report indications 
that they had been readily manipulable 
into the disparate feeling states of 
euphoria and anger. 

From this first proposition, it must 
follow that given a state of physiologi- 
cal arousal for which the individual has 
a completely satisfactory explanation, 
he will not label this state in terms of 
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the alternative cognitions available. 
Experimental evidence strongly sup- 
ports this expectation. In those con- 
ditions in which subjects were injected 
with epinephrine and told precisely 
what they would feel and why, they 
proved relatively immune to any effects 
of the manipulated cognitions. In the 
anger condition, such subjects did not 
report or show anger; in the euphoria 
condition, such subjects reported them- 
selves as far less happy than subjects 
with an identical bodily state but no 
adequate knowledge of why they felt 
the way they did. 

Finally, it has been suggested that 
given constant cognitive circumstances, 
an individual will react emotionally 
only to the extent that he experiences 
à state of physiological arousal. With- 
out taking account of experimental arti- 
facts, the evidence in support of this 
proposition is consistent but tentative. 
When the effects of “self-informing” 
tendencies in epinephrine subjects and 
of “self-arousing” tendencies in pla- 
cebo subjects are partialed out, the 
evidence strongly supports the proposi- 
tion. 

The pattern of data, then, falls neatly 
in line with theoretical expectations, 
However, the fact that we were forced, 
to some extent, to rely on internal 
analyses in order to partial out the 
effects of experimental artifacts inevita- 
bly makes our conclusions somewhat 
tentative. In order to further test these 
propositions on the interaction of cog- 
nitive and physiological determinants 
of emotional state, a series of additional 
experiments, published elsewhere, was 
designed to rule out or overcome the 
operation of these artifacts. In the first 
of these, Schachter and Wheeler 
(1962) extended the range of manipu- 

lated sympathetic activation by em- 
loying three experimental groups— 
3 inephrine, placebo, and a group in- 
jected with the sympatholytic agent, 


chlorpromazine. Laughter at a slap- 
stick movie was the dependent variable 
and the evidence is good that amuse- 
ment is a direct function of manipu- 
lated sympathetic activation. 

In order to make the epinephrine- 
placebo comparison under conditions 
which would rule out the operation of 
any self-informing tendency, two ex- 
periments were conducted on rats. In 
one of these Singer (1961) demon- 
strated that under fear inducing condi- 
tions, manipulated by the simultaneous 
presentation of a loud bell, a buzzer, 
and a bright flashing light, rats in- 
jected with epinephrine were consider- 
ably more frightened than rats injected 
with a placebo. Epinephrine-injected 
rats defecated, urinated, and trembled 
more than did placebo-injected rats. In 
nonfear control conditions, there were 
no differences between epinephrine and 
placebo groups, neither group giving 
any indication of fear. In another 
study, Latané and Schachter (1962) 
demonstrated that rats injected with 
epinephrine were notably more capable 
of avoidance learning than were rats 
injected with a placebo. Using a modi- 
fied Miller-Mowrer shuttlebox, these 
investigators found that during an ex- 
perimental period involving 200 massed 
trials, 15 rats injected with epinephrine 
avoided shock an average of 101.2 trials 
while 15 placebo-injected rats averaged 
only 37.3 avoidances. 

Taken together, this body of studies 
does give strong Support to the proposi- 
tions which generated these experi- 
mental tests. Given a state of sympa- 
thetic activation, for which no imme- 
diately appropriate explanation is 
available, human subjects can be readily 
manipulated into states of euphoria, 
anger, and amusement. Varying the 
Intensity of sympathetic activation 
Serves to vary the intensity of a vari- 


ety of emotional states in both rats and 
human subjects, 


——— 


DETERMI) 


Let us examine the implications of 
these findings and of this line of 
thought for problems in the general 
area of the physiology of the emotions. 
We have noted in the introduction that 
the numerous studies on physiological 
differentiators of emotional states have, 
viewed en masse, yielded quite incon- 
clusive results. Most, though not all, 
of these studies have indicated no dif- 
ferences among the various emotional 
states. Since as human beings, rather 
than as scientists, we have no difficulty 
identifying, labeling, and distinguishing 
among our feelings, the results of these 
studies have long seemed rather puz- 
zling and paradoxical. Perhaps because 
of this, there has been a persistent 
tendency to discount such results as 
due to ignorance or methodological in- 
adequacy and to pay far more atten- 
tion to the very few studies which 
demonstrate some sort of physiological 
differences among emotional states than 
to the very many studies which indi- 
cate no differences at all. It is con- 
ceivable, however, that these results 
should be taken at face value and that 
emotional states may, indeed, be gener- 
ally characterized by a high level of 
sympathetic activation with few if any 
physiological distinguishers among the 
many emotional states. If this is cor- 
rect, the findings of the present study 
may help to resolve the problem. Ob- 
viously this study does not rule out the 
possibility of physiological differences 
among the emotional states. It is the 
case, however, that given precisely the 
same state of epinephrine-induced sym- 
pathetic activation, we have, by means 
of cognitive manipulations, been able 
to produce in our subjects the very 
disparate states of euphoria and anger. 
It may indeed be the case that cognitive 
factors are major determiners of the 
emotional labels we apply to a common 
state of sympathetic arousal. 

Let us ask next whether our results 
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are specific to the state of sympathetic 
activation or if they are generalizable 
to other states of physiological arousal. 
It is clear that from our experiments 
proper, it is impossible to answer the 
question for our studies have been con- 
cerned largely with the effects of an 
epinephrine created state of sympa- 
thetic arousal. We would suggest, 
however, that our conclusions are gen- 
eralizable to almost any pronounced in- 
ternal state for which no appropriate 
explanation is available. This sugges- 
tion receives some support from the ex- 
periences of Nowlis and Nowlis (1956) 
in their program of research on the 
effects of drugs on mood. In their 
work the Nowlises typically administer 
a drug to groups of four subjects who 
are physically in one another’s presence 
and free to interact. The Nowlises de- 
scribe some of their results with these 
groups as follows: 


At first we used the same drug for all 
4 men, In those sessions seconal, when com- 
pared with placebo, increased the checking of 
such words as expansive, forceful, coura- 
geous, daring, elated, and impulsive. In our 
first statistical analysis we were confronted 
with the stubborn fact that when the same 
drug is given to all 4 men in a group, the N 
that has to be entered into the analysis is 1, 
not 4. This increases the cost of an already 
expensive experiment by a considerable fac- 
tor, but it cannot be denied that the effects of 
these drugs may be and often are quite con- 
tagious. Our first attempted solution was to 
run tests on groups in which each man had 
a different drug during the same session, such 
as 1 on seconal, 1 on benzedrine, 1 on 
dramamine, and 1 on placebo. What does 
seconal do? Cooped up with, say, the ego- 
tistical benzedrine partner, the withdrawn, 
indifferent dramimine partner, and the 
slightly bored lactose man, the seconal sub- 
ject reports that he is distractible, dizzy, 
drifting, glum, defiant, languid, sluggish, dis- 
couraged, dull, gloomy, lazy, and slow! This 
is not the report of mood that we got when 
all 4 men were on seconal, It thus appears 
that the moods of the partners do definitely 
influence the effect of seconal (p. 350). 


It is not completely clear from this 
description whether this “contagion” 
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of mood is more marked in drug than 
in placebo groups, but should this be 
the case, these results would certainly 
support the suggestion that our findings 
are generalizable to internal states 
other than that produced by an injec- 
tion of epinephrine. 

Finally, let us consider the implica- 
tions of our formulation and data for 
alternative conceptualizations of emo- 
tion. Perhaps the most popular cur- 
rent conception of emotion is in terms 
of “activation theory” in the sense em- 
ployed by Lindsley (1951) and Wood- 
worth and Schlosberg (1958). As we 
understand this theory, it suggests that 
emotional states should be considered 
as at one end of a continuum of activa- 
tion which is defined in terms of degree 
of autonomic arousal and of electro- 
encephalographic measures of activa- 
tion. The results of the experiment de- 
scribed in this paper do, of course, 
suggest that such a formulation is not 
completely adequate. It is possible to 
have very high degrees of activation 
without a subject either appearing to be 
or describing himself as “emotional.” 
Cognitive factors appear to be indis- 
pensable elements in any formulation 
of emotion. 


SUMMARY 


It is suggested that emotional states 
may be considered a function of a state 
of physiological arousal and of a cogni- 
tion appropriate to this state of arousal. 
From this follows these propositions : 

1. Given a state of physiological 
arousal for which an individual has no 
immediate explanation, he will label 
this state and describe his feelings. in 
terms of the cognitions available to him. 
To the extent that cognitive factors are 
potent determiners of emotional states, 
it should be anticipated that precisely 
the same state of physiological arousal 
could be labeled “joy” or "fury" or 
“jealousy” or any of a great diversity 


of emotional labels depending on the 
cognitive aspects of the situation. 

2. Given a state of physiological 
arousal for which an individual has a 
completely appropriate explanation, no 
evaluative needs will arise and the in- 
dividual is unlikely to label his feelings 
in terms of the alternative cognitions 
available. 

3. Given the same cognitive circum- 
stances, the individual will react emo- 
tionally or describe his feelings as emo- 
tions only to the extent that he experi- 
ences a state of physiological arousal. 

An experiment is described which, 
together with the results of other 
studies, supports these propositions. 
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ON THE QUANTIFICATION OF MILLER’S 
CONFLICT THEORY? 


NORMAN H. ANDERSON 
University of California, Los Angeles 


This paper elucidates a mathemat- 
ical model of potential usefulness in 
N. E. Miller's theory of conflict. 
Miller has discussed his theory most 
extensively in his provocative 1959 
article and the present model is based 
fairly directly on that discussion. 
For simplicity of exposition, the model 
will be discussed in a rather restricted 
context. Only two particular experi- 
mental designs for a specific experi- 
mental task involving approach-avoid- 
ance conflict will be considered. 
However, the basic ideas and the tech- 
niques used have a considerably 
greater range of applicability. 

In the experimental task to be con- 
sidered, the human subject taps a key 
once upon presentation of a signal 
which is set to repeat at timed in- 
tervals. A goal, which may be rep- 
resented by illuminating one of a row 
of lamps, moves one step closer to the 
subject at each tap of the key. When 
the last tap is made, the goal moves 
the last step, and the subject receives 
reward and punishment (e.g., coin and 
shock), either simultaneously or, in 
certain cases, on different trials. The 
dependent variable is response Speed 
(reciprocal latency) as a function of 
distance from the goal. Each trial 
thus yields essentially a complete 

as the datum. 
Piscium for an overt retreat re- 
sponse could also be made and ke 
possibly be useful in studying will he 
point vacillation. However, it will be 


1A preliminary version of this pape wan 
presented at the first annual meeting of the 
Psychonomic Society, September 1960, in 


Chicago. 


assumed that only an approach or 
nonapproach response is permitted 
and, moreover, that the reward and 
punishment conditions are such that 
the subject reaches the goal on each 
trial. This latter restriction simplifies 
the presentation by avoiding, in par- 
ticular, the more detailed statistical 
discussion needed when some response 
curves are incomplete. 

The task described breaks up the 
goal response into a sequence of sub- 
responses which has a number of ad- 
vantages. The successive points on 
the response curve are more nearly 
physiologically independent than if a 
free continuous response were used. 
Since the goal moves, rather than the 
subject, the motor aspect of the suc- 
cessive subresponses should remain 
the same. In addition, various rein- 
forcement schedules may be pro- 
gramed over the last several steps to 
the goal. Finally, the use of discrete 
subresponses provides simultaneous 
information on response probability 
and response strength which should 
be useful theoretically (Estes, 1950; 
Mueller, 1950; Spence, 1954), 


MopkrL 
Assumptions 


The model rests on the following 
assumptions: 

Ai. The approach and avoidance 
gradients exist and are independent. 

A». Response speed is proportiona 
tothe difference between the approach 
and avoidance gradients. Since the 
constant of proportionality represents 
only the choice of units on the meas- 
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urement scales, it may conveniently 
be taken equal to unity. The assump- 
tion may then be written : 


S(x) = F(x) — G(x), 


where x is distance from the goal, 
F(x) is the approach gradient, G(x) is 
the avoidance gradient, and S(x) is re- 
sponse speed. 

As. The approach and avoidance 
gradients can be written in the follow- 
ing form: 


F(x) = af(x) 4- 6 F c 
G(x) = dg(x) te +f, 


where a, b are parameters depending 
on reward ; d, e are parameters depend- 
ing on punishment; c, f are constants 
in the given experimental situation; 


f(x), g(x) are completely arbitrary 


functions. The functions, f(x) and 
g(x), will be called the shape func- 
tions of the approach and avoid- 
ance gradients, respectively. The 
Six parameters appearing in Ag are 
assumed to vary with individuals and 
with trials. However it will be as- 
sumed here that no such variation 
occurs in the shape functions. This 
latter assumption will be discussed 
further under Application 2. 

Of the three assumptions of the 
model, A, is the most stringent. The 
statement that the approach and avoid- 
ance tendencies exist is the main 
assertion about the psychological proc- 
esses underlying the behavior. The 
statement that the gradients are in- 
dependent is a brief way of saying 
that reward affects only the approach 
gradient and that punishment affects 
only the avoidance gradient. Of 
these two independence propositions, 
the first seems not unreasonable but 
the second should perhaps be viewed 
with greater reserve. Nevertheless, 
Miller does make these assumptions, 
both explicitly in Figures 7 and 9 of 
the cited reference (Miller, 1959), and 


implicitly in his procedure of com- 
paring separately obtained approach 
and avoidance response curves. 
These assumptions will accordingly be 
used here. A somewhat weaker in- 
dependence requirement will be dis- 
cussed below. 

'The proportionality assumption, 
As, is more stringent than that made 
by Miller (1959) who requires only 
that the response measure be some 
monotone function of the difference 
between the gradients. It is, how- 
ever, not entirely out of line with Hull- 
Spence theory (e.g., Logan, 1952; 
Spence, 1956) if the gradients are 
interpreted as reaction potentials. In 
any case, it will be seen below that 
only monotonicity is required if an 
appropriate experimental design is 
used. 

Assumption A; is fairly weak since 
no restriction is placed on f(x) and 
g(x); these functions are entirely 
arbitrary and need not even be esti- 
mated for the proposed tests of the 
model. The given representation of 
the approach gradient assumes that 
the effect of reward is partly multi- 
plicative and partly additive, as 
represented by the parameters a and b, 
respectively, and an analogous state- 
ment holds for punishment. The use 
of six parameters may seem peculiarly 
generous but the first test of the 
model will be completely parameter 
free, and the second test requires 
evaluation of only two parameters. 

Before proceeding to the deriva- 
tions, a few remarks on averaging are 
required since biases may result from 
differences between individuals or 
trials (Anderson, 1959, 1960, p. 85). 
For the present case, consider a given 
set of subjects and trials. Because of 
individual differences, subjects treated 
alike will in general have different 
gradients. Moreover, gradients will 
vary with trials because of learning, 
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random fluctuation, or adaptation 
effects of various kinds. To the given 
set of subjects and trials therefore, 
corresponds a set or distribution of 
gradients of either type. Assumption 
A; implies that experimental condi- 
tions with say, the same punishment 
but different rewards will have the 
same distribution of avoidance gra- 
dients. This conclusion follows from 
A, under the present restriction that 
the subjects reach the goal on each 
trial, thus receiving both reward and 
punishment on each trial. Given this 
restriction, the assumption that re- 
ward and punishment act independ- 
ently implies that the experimental 
conditions provide equivalent punish- 
ment treatment, and so yield identical 
distributions of avoidance gradients. 
In the same way, it is seen that experi- 
mental conditions with the same 
reward but different punishment have 
the same mean approach gradient. 
These two results are used to justify 
the use of average data in the follow- 
ing applications of the model. 


Application 1 


This application is based on a 2 E 
design with two values each of reward 
and punishment. Consider a given 
subject receiving Reward i and Pun- 
ishment j, and let P;(x) be the ap- 
proach gradient, G;(x) the avoidance 
gradient, and .S,;(x) the speed func- 
tion for this subject on a given trial. 
Then by A»: 


Sy) = FG) = GiG) C1] 


This result holds for a given subject 
on a given trial. If a set of subjects 
and trials is considered, there will bea 
corresponding set or distribution of 
gradients. As shown above, the — 
approach gradient, averaged over t a 
distribution, will be the same for the 
two experimental conditions which 


ANDERSON 


have the same reward. Similarly, the 
mean avoidance gradient will depend 
only on the punishment involved, not 
on the reward. Since the mean of the 
difference on the right side of Equa- 
tion 1 is the difference of the means, 
it follows that Equation 1 holds on 
the average over the set of subjects 
and trials under consideration. The 
quantities appearing in Equation 1 
will accordingly be interpreted as 
means. When the four theoretical 
mean speed functions of Equation 1 
are placed in the 2 X 2 table of the 
design, the following array is ob- 
tained: 


Pix) — Gi(x) 
P(x) — Gi(x) 


F(x) — G(x) [2] 
Fy (x) — Go(x). 


Inspection of this array shows that 
the difference between the two entries 
in the first row is equal to the differ- 
ence between the two entries in the 
second row. This means, in analy 
of variance terms, that the interaction 
of the row (reward) and column 
(punishment) variables is zero at each 
value of x. The model thus implies 
that the Reward x Punishment, aud 
the Reward x Punishment X Dis- 
tance interactions are zero in principle, 
and hence nonsignificant in practice. 
Consequently, the ordinary analysis 
of variance may be applied directly 
to the obtained speed curves in order 
to test the model. 

This application of the model re- 
quires only A, and As and is com- 
pletely Parameter free. Analogous 
results hold, of course, for any fac- 
torial design with reward and punish- 
ment as two of the factors. 


Application 2 


Consider now a situation with fixed 
Punishment and £ conditions of ie 
ward. The speed curve on a given 
trial for a given subject under Rewat 
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i may be written: 
Si(x) = aif(x) + bi + c — G(x) 
$-1;23,£ [R] 


If Equation 3 is to be useful, it must 
be averaged over the distribution of 
gradients corresponding to a given set 
of subjects and trials. Now the aver- 
age of the sum of terms on the right 
side of the equation is the sum of the 
average terms. As noted above, the 
mean G(x) is the same in all reward 
conditions, and so also is the mean 
value of c. Since f(x) was taken as 
fixed, the mean of the product, a:f (x), 
is just f(x) times the mean a;. Hence 
Equation 3 is valid if each term is 
interpreted as a mean and this will be 
done. 

A rough test of the model for this 
application may be obtained by using 
Equation 3 to get expressions for, 
say, Ss(x) — Si1(x), and Ss(x) — Si(x). 
These expressions may fairly easily be 
combined to show that the one differ- 
ence is a simple linear function of the 
other difference: 


S3(x) — Si(x) 
= ALS2(v) — Si(x)] + B, [4] 


where 4 and B are constants whose 
exact expressions in terms of the a; 
and 6; are not of importance. The 
values of .1 and B could be estimated 
by using selected values of x, or by 
least squares. An assessment of good- 
ness of fit could then be made by eve 
and this procedure, crude though it is, 
might be useful in many cases. 

A rigorous statistical test for Appli- 
cation 2 may be obtained using the 
multivariate analysis of variance in 
the form of Fisher's (1938) test of 
coplanarity. For suppose Equation 3 
is correct, and that the speed gra- 
dients are measured at, say, p values 
of x, Then it can readily be shown 
that the p X p between conditions 


sums of products matrix has Rank 2 


(g 2 3). The null hypothesis to be 
tested is then, in Rao's (1952) nota- 
tion, that the smallest p — 2 latent 
roots of B — AW are zero, where B 
and IV are the between and within 
conditions sum of products matrices, 
respectively. Rao gives numerical 
examples (pp. 364-375). The tests 
are approximate but evidently quite 
adequate in respect of significance 
level. The results of Marriott (1956) 
might be expected to vield somewhat 
more powerful tests. Unfortunately 
both procedures are somewhat tedious 
with a desk calculator. 

Application 2 requires only half of 
Ay, namely, that reward does not 
affect the avoidance gradient. Also 
only half of A; is used since no restric- 
tion is placed on G(x). The treat- 
ment of the case of fixed reward and 
variable punishment can be done in 
essentially the same way. 

Since Application 2 uses the ap- 
proximate approach gradient repre- 
sentation of As; the results are not 
exact, in contrast to those of the first 
application. It is interesting to note 
that the same results are obtained if 
à; is assumed fixed within each reward 
condition and f(x) is allowed to vary 
over trials and subjects. It would 
still be required that the mean f(x) be 
the same in each condition, but the 
individual subjects could have sharply 
different approach gradient shape 
functions. 

If both a; and f(x) were to be con- 
sidered as variable over subjects and 
trials, each individual approach gra- 
dient would be represented exactly. 
However, if both a; and f(x) are 
variable, the mean of their product is 
the product of their means, plus the 
covariance of the two factors. In 
this case, Equation 3, interpreted in 
terms of means, would be biased be- 
cause this covariance term is not in- 
cluded. Obversely, the covariance 
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term indexes the adequacy of the 
approximation of the gradient repre- 
sentation of A;. The bias in Equation 
3 tends to cancel in the tests of good- 
ness of fit and is not thought to be 
serious. However, it should be noted 
that the bias may be reduced by re- 
ducing the variability between trials 
and between subjects. Trial vari- 
ability may be reduced by restricting 
attention to asymptotic data. The 
subject variability may be reduced by 
considering homogeneous blocks of 
subjects including, in the limit, single 
subjects. 


Comments 


If the model fails the experimental 
tests of the sort illustrated in the 
above two applications, one or more 
of the assumptions must be at fault. 

The proportionality assumption, 
As, although presumably a reasonable 
first approximation, may well not be 
sufficiently exact for the quantitative 
tests under consideration. However, 
if the response measure is not propor- 
tional to the difference between the 
gradients, it should in any case be 
some strictly monotone function of 
this difference. From this it follows, 
by taking inverse functions, that there 
is some monotone function of the 
response measure which is propor- 
tional to the difference between the 
two gradients. Thus, if A» alone is 
at fault, it may in principle be rectified 
by employing this latter monotone 

function of the response measure in 
place of the original response measure. 
The only problem is whether this 
scaling of the response measure may 
be accomplished in practice. For the 
cited applications, the scaling pee 
no particular difficulty and the details 
are given in a subsequent section. : 

It thus follows that, if no m 
the data satisfies the model, one of the 
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other assumptions must be incorrect. 
In this case, since A; is quite weak and 
is used only in Application 2, Ai 
would become suspect. 

Assumption A; might be incorrect 
for two different reasons. First, the 
approach and avoidance gradients 
might give an inadequate representa- 
tion of the processes underlying the 
behavior. Indeed, the present de- 
velopment, in contrast to that of 
Bower (1959) and Spence (1960, Ch. 
22), does not give an explicit account 
of choice point vacillation. Further 
investigation of this aspect of the be- 
havior would probably be quite useful 
as Miller (1959) has observed, and 
would be expected to guide any neces- 
Sary revisions of the two gradient 
representation. 

The second reason for rejection of 
Ai would be nonindependence of the 
approach and avoidance gradients. 
If in fact reward and punishment 
affect both gradients, it will probably 
be necessary to include the learning 
processes explicitly in the model. 
This problem has not been investi- 
gated in any detail since it seems 
Preferable to base the initial tests 
of the model on the simplest as- 
sumptions. 

If the model is supported by the ex- 
perimental tests, one's confidence in 
the assumptions would be increased, 
but it would still be useful to ask how 
much had been learned. Success in 
prediction gives only an indirect con- 
firmation of the premises which 
yielded the prediction. Thus success- 
ful prediction would not prove Ai 
because alternative assumptions might 
imply the same results as, indeed, the 
following example shows. 

Consider Application 2 with reward 
and punishment given on separate 
trials. Suppose that the avoidance 
gradient has the same shape function 
as the approach gradient. Equation 
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3 may then be written: 


Si(x) = (a; — d) f(x) 
oe (De = is)! -F (6 — A 


Suppose also that there is a decrement 
in the avoidance gradient on rewarded 
trials which results simply from the 
absence of punishment and so is 
independent of reward magnitude. 
This decrement would be represented 
by decreases in d; and e; in the 
equation above. Inspection of the 
equation shows that it is impossible to 
distinguish decreases in d; and e; from 
corresponding increases in a; and b;. 
It would thus be impossible to dis- 
tinguish a simultaneous increase in 
F(x) and decrease in G(x) from a 
larger increase in F(x) and no change 
in G(x). Consequently, if it is also 
assumed that any effect of punish- 
ment on the reward gradient is in- 
dependent of amount of reward, then 
the results derived in Application 2 
hold also in the present case. 

The example of the preceding para- 
graph, which seems to be a fairly 
reasonable possibility, indicates that 
the model will hold under a wider 
range of assumptions than those 
actually used in the derivation. This 
extra generality is a mixed blessing 
Since, except for certain practical 
Situations, prediction of behavior is 
only a means to the end of under- 
standing behavior. However, the 
model does discriminate between vari- 
ants of A, which are of interest. 
Moreover, it does not seem likely that 
the model predictions would be veri- 
fied unless the measuring scale for the 
dependent variable were correct. 

The estimation and interpretation 
of the shape functions and of the 
reward and punishment parameters 
also require discussion. The repre- 
sentation of the gradients used in A; 
is quite flexible and should give a good 


fit to any particular set of gradients. 
However, it is a representation of 
convenience, one which dissects the 
gradients in a mathematically power- 
ful but possibly unpsychological way. 
Consequently, the reward and punish- 
ment parameters do not necessarily 
have any direct psychological inter- 
pretation. Whether or not they do 
cannot be determined without further 
empirical or theoretical analysis. 
Since the tests of the model do not 
require the estimation of these param- 
eters, this problem is not considered 
here. Some comments on the inter- 
pretation of the parameters are given 
in a subsequent section. 

Matters are somewhat clearer as 
regards the shape functions. If the 
model survives the test of Application 
2, then Equation 3 may be used to 
estimate the shape function of the 
approach gradient. This shape func- 
tion is only unique up to a multi- 
plicative factor and an additive term 
so that it might perhaps be desirable 
to reduce to a standard form by 
setting f(0) = 1, f(«) — 0. How 
generally useful the resultant estimate 
would be is an open question. As 
Miller (1959) indicates, the gradients 
will be strongly influenced by the 
stimulus properties of the situation. 
There would thus be no reason to 
expect more than a rough similarity 
between shape functions from differ- 
ent experiments unless the procedures 
were carefully standardized. Within 
a given situation, however, estimation 
of the shape functions would be ex- 
pected to be relevant to Miller's hy- 
pothesis that the avoidance gradient 
has a steeper slope than the approach 
gradient. 

Another problem deserving com- 
ment arises because the present ap- 
plications of the model are based on 
comparisons of different experimental 
conditions in which either the ap- 
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proach gradient or the avoidance 
gradient is the same in two or more 
conditions. If the present restriction 
that the subjects reach the goal on 
every trial is not met, the required 
comparability may not obtain. Thus 
subjects receiving a greater punish- 
ment might reach the goal less often 
than subjects receiving a lesser pun- 
ishment. If so, the number of rewards 
would not be the same in the two 
conditions, the approach gradients 
would not be the same, and the above 
applications need not hold. In situa- 
tions of this sort, there may be some 
justification for applying the model to 
the asymptotic data. For, by A, the 
same asymptotic reward gradients 
should be obtained, the lower rate of 
reward in the condition with greater 
punishment affecting only the rate at 
which the asymptote is reached. 

In this connection, it is worth men- 
tioning a potentially useful experi- 
mental procedure. The subject is run 
under a given reward and punishment 
until the behavior has stabilized and 
then told that on the next trial the 
reward will be increased. By Aj, this 
announcement should affect only the 
approach gradient. The avoidance 
gradients would be the same on the 
trial preceding as on the trial following 
the reward so that the data from these 
two trials would be comparable in the 
desired way. 

Finally it should be noted that the 
model may be more generally appli- 
cable than has been indicated. It 
might be possible, depending on the 
experimental setup, to ipe Ww 
two gradients as approach gra rents 
with respect to two incompatible 
goals. In particular, some of red 
sults of Application 1 hold even when 
there is but one step from start to goal. 
These results would be noreneally 
applicable to situations involving two 
competing response tendencies. 
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LEARNING 


Explicit incorporation of the learn- 
ing processes into the model appears 
to present some difficulties. This 
section outlines one possible approach 
which should help set out the problems 
involved. The discussion is not es- 
sential to the rest of the paper and 
only a very brief sketch will be given. 

For illustrative purposes, it is as- 
sumed that reward is given on a ran- 
dom proportion, z, of the trials, 
punishment being given on the re- 
maining 1 — z of the trials. In the 
learning model to be considered, it is 
assumed that each gradient changes 
on each trial by a constant fraction of 
the possible change. This propor- 
tional change assumption is fairly 
common (Anderson, 1961; Bush & 
Mosteller, 1955; Estes, 1950; Hull, 
1943) but it is here being applied to 
the whole gradient. Evidently it is 
necessary to allow the possibility of 
different constant fractions for the 
two gradients and for the two types of 
trials. 

If reward is given on Trial 7, the 
gradients on Trial x + 1 are: 


Fria(x) = F,(x) 

sx) — F(x], L5 
Ganla) = G(x) 

+ O:[G1 (x) — G.(x)]. 
If punishment is given on Trial z, the 
gradients on Trial 1 T 1are: 
PFua(x) = PF) 

+ o'LPi(@) — Fax), D6l 
Grsi(x) = G(x) 

+ 9 [Gu (x) — Ga]. 
Here 01, 05, 0'1, 0's are the four learning 
rates; subscripts » and n + 1 index 
the gradients on Trials n and n + 13 


] Y * r 
subscripts U and L index the pus 
and lower limit gradients, quantitie 


i 
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analogous to the limit points of the 
linear operators of Bush and Mosteller 
(1955). The quantities in brackets 
thus denote the possible change on 
Trial z, and the equations state simply 
that the actual change is a constant 
fraction of the possible change. 

Although the learning model is 
conceptually simple, its complete 
analysis would involve the estimation 
of four learning rates and four limit 
gradients. It seems reasonable to 
Suppose that two of the limit gra- 
dients, F;(x) and Gz(x), are identi- 
cally zero, and this will be done in 
what follows. Further restrictions on 
the parameters seem problematical so 
that the estimation problem would be 
difficult. However, analysis of the 
sequential dependencies, that is, of 
the trial-to-trial increments and de- 
crements in the response measure, 
does yield some information which 
may be useful. 

On the basis of the above formula- 
tion, the speed increment on rewarded 
trials, denoted by Cj (x), can be shown 
to be: 


Cr(x) = ALFu (x) — Pa(x) 
+ 0G, (x). 
On punished trials, the speed decre- 
ment is: 
Cp(x) = 0 F, (x) 
+ 8s [Ge(x) — Gal]: 


The sum of the increment and decre- 
ment is thus: 


Cr(x) + Cr(x) = 0 Fe (x) 
+ Ge (x) + 0i — 01) Fx) 
— (à; — 63G.(x). [7] 


The left side of the equation may be 
obtained from the data. For a given 
set of trials, C(x) would be estimated 
by the speed increment averaged over 
all rewarded trials; similarly Cp(x) 
would be estimated by the speed 


decrement averaged over all punished 
trials. F„(x) and G,(x) would cor- 
respondingly be replaced by their 
theoretical averages over the given 
set of trials. 

Now F,(x) and Ga (x) depend on the 
value of r; however, the other quan- 
tities on the right side of Equation 7 
are independent of m. If reward and 
punishment had equivalent effects so 
that 0; = 61, and 6’) = 62, then the 
last two terms on the right side vanish. 
In this case, the model implies that 
Cr(x) + Cp(x) has the same values 
for different groups run under differ- 
ent values of m. In this case, there- 
fore, the sequential dependency pro- 
vides an exact test of the model. 
Since the last two terms of Equation 7 
involve learning rate differences, the 
sequential dependency should be ap- 
proximately independent of m as long 
as the effects of reward and punish- 
ment are not too disparate. 

If punishment is fairly strong com- 
pared to reward, it would be expected 
that both 0'; > and 0s > 0s. Then, 
since ,(x) is an increasing function 
of z, and Ga (x) is a decreasing function 
of z, the sequential dependency should 
show modest increases as 7 is in- 
creased. Conversely, if punishment 
is fairly weak compared to reward, it 
would be expected that 0^; < 61, and 
0'2 < 0. The sequential dependency 
would then show modest decreases as 
7 is increased. 

Related results may be obtained 
from the higher order sequential de- 
pendencies, and one of these deserves 
particular mention. Let Cpr(x) de- 
note the change in response caused by 
a pair of successive trials, the first of 
which is punished and the second 
rewarded. Similarly, let Crp(x) be 
the speed change caused by a rewarded 
and a punished trial given in that 
order. The differences between these 
quantities is always positive and 
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indeed: 


Crr(x) — Crr(x) 
= 010'sFu(x) + 0.0'sGu(x). I8] 


The difference on the left side of this 
equation is to be estimated from the 
data in the same way as mentioned 
above. The equation asserts that 
this difference is independent: of z. 
This result affords a rigorous non- 
parametric test of goodness of fit. It 
should also be noted that the differ- 
ence of Equation 8 varies directly with 
the learning rates thus yielding a 
qualitative check of the model. 

The above results are less complete 
than would be desired but they do 
permit an assessment of the model. 
Although in principle the sequential 
dependencies provide the information 
for a complete solution to the pa- 
rameter estimation problem (Ander- 
son, 1959), practical difficulties are 
great. Given two types of gradient 
and two types of reinforcing stimuli, 
however, there are necessarily a con- 
siderable number of quantities to be 
estimated, Consequently, it may be 
useful to consider the possibility of a 
representation of the conflict situation 
which is mathematically more tract- 
able even if possibly less realistic, It 
should be noted, however, that the 
trial-to-trial increments and decre- 
ments in response strength are the 
basic quantities in the study of learn- 
ing, and their investigation should 
prove valuable under any circum- 


stances. 
SCALING THE RESPONSE MEASURE 


This section shows how the scaling 
of the response measure mentioned 
above may be accomplished. _ The 
method is general in that it applies to 
any continuous response Med 
However, the scaling E E 
fairly specific to the particular ex- 


perimental design employed so that a 
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general presentation is not possible. 
Consequently, only a brief sketch 
indicating how the method would be 
used in the above two applications will 
be given. Statistical details, such as 
the loss of df in estimating the aux- 
iliary scaling parameters, will not be 
considered. 

The basic idea of the present 
method is to use the theoretical equa- 
tions directly in the scaling procedure. 
This approach may be more generally 
applicable. To the author, it seems 
considerably more attractive than 
Hull's (Hull, Felsinger, Gladstone, & 
Yamaguchi, 1947) quantification pro- 
cedure because the psychophysical- 
Psychometric assumptions on which 
the latter is based are not closely 
related to the form of the theoretical 
behavior laws proposed for investi- 
gation. 

If scaling is possible in principle, 
then, as noted above, there is some 
monotone function, M, of the observed 
Tesponse measure, S(x), which does 
satisfy the proportionality assump- 
tion, A». Expanding om[S(x)] in a 
Maclaurin series yields: 


9n[S(x)] = S(x) + aS? (x) 
+ asS%(x) +--+, [91 


The constant term in the series ex- 
Pansion does not appear since it is 
required that M(0) = 0. Moreover; 
it is no restriction on present purposes 
to take the coefficient, ai, of S(x) to be 
unity and this has been done. The 
number of terms of the series which 
would be required depends on how 
well the original scale approximates 
the final scale; only the first three 
terms of the series will be used here 
since it would seem that that shoul 
be adequate for most work. 

Consider first Application 1 and for 
convenience of notation let: 


P(x) = Su” (x) — Si” (x) = Sa” (s) 
+ S2”(x), n=1,2, 7 
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The assumption that S(x) is propor- 
tional to the difference between the 
gradients is equivalent to the require- 
ment that the Reward X Punishment 
interactions of Equation 2 be zero. 
These interactions will be zero when 
the difference between the entries in 
the first row equals the difference be- 
tween the entries in the second row 
for each value of x. In present nota- 
tion, this condition means concisely 
that Pi(x) = 0 for each x. 

If 9n[.S (x) ], rather than S(x) itself, 
is to be proportional to the difference 
between the gradients, a condition 
analogous to P(x) = 0 must hold. 
If S(x) in Equation 2 is replaced by 
INLS(x)] as represented in the series 
expansion of Equation 9, this condi- 
tion is seen to be: 


Pi(x) + asPo(x) + a3P3(x) = 0. 


Scaling is thus equivalent to choos- 
ing a» and as so that this last equation 
is true for each x. In practice, of 
course, this would not be possible 
because of sampling error. Conse- 
quently, o» and a3 are chosen so as to 
minimize the interaction, using the 
method of least squares (e.g., Lewis, 
1960). 'Thesum of squared deviations 
is ZQLPi(x) + aePo(x) + a3P3(x) F. 

his sum is to be differentiated with 
respect to a» and o; and these deriva- 
tives set equal to zero. The following 
two linear equations result: 


DzP1(x)Po(x) + oZ. Ps (x) 


+ asP2P2(x)P3(x) = 0, 
E.P (x) Pax) + a9ZzPs (x) Ps(x) 
ToosX.Pg (x) = 0. 


Since the values of the P,(x) are 
obtainable from the raw data, these 
equations are easily solved for a and 
a3. With these values of a2 and as, 
the observed responses would be 
scaled using Equation 9 and the 


alysis of variance applied directly 
y 


X 
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to the scaled data. If the interaction 
remains significant in the scaled data, 
then the scaled response measure does 
not satisfy the proportionality as- 
sumption. lf additional terms of the 
series expansion of Equation 9 were 
used, say N + 1 altogether, the scal- 
ing method would yield the following 
N linear equations in V unknowns: 


Z;X,,P,(x)P.(x) = 0, 
t= 2,3, 0, NFL 


The scaling technique for Applica- 


tion 2 follows similar lines. Let: 
Ra(x) = Ss^(x) — AS” (x) 
+ (A —1)Si"@), »21,2,--*. 


From Equation 4, the requirement of 
the model in terms of the original re- 
sponse measure is that R, (x) — B = 0. 
In terms of the scaled measure, this 
requirement becomes: 


Ri (x) + asRo(x) + asR3(x) — B = 0. 


It is thus required to minimize 
E.LRi(x) + a2Re(x) + a3R3(x) — BY. 
This minimization may be accom- 
plished by using least squares to ob- 
tain four equations in four unknowns, 
o», a3, 4, B. The equations, which 
are not given here, are unfortunately 
mildly nonlinear in the unknowns so 
that a process of successive approxi- 
mation would be required for their 
solution. 

Scales obtained from the above 
procedure are subject to a bias since 
the mean of the transformed scores 
will not in general be exactly equal to 
the transform of the mean score. The 
amount of bias would be expected to 
be negligible unless M is considerably 
nonlinear. If WM is badly nonlinear, 
it would probably be advisable to 
iterate the scaling procedure in order 
to converge on an unbiased scale. 

If the M obtained from the scaling 
procedure is nonmonotone, then one 
must of course reject the possibility of 
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a theoretically adequate scaling, and 
hence also the basic theory. How- 
ever, it is apparent that the scaling 
technique may be rather too obliging 
in yielding too readily, in a given ex- 
perimental situation, a transformation 
of the observed response measure 
which satisfies the model. Neverthe- 
less, there is a requirement of gen- 
erality implicit in the notion of a 
measurement scale; it must be appli- 
cable to a somewhat wider class of 
situations than that on which the 
scaling is based. Essentially this 
problem has been discussed by Seward 
(1955) as a requirement of constancy 
of the intervening variable. It suf- 
fices here to note that this requirement 
of generality also means that the scal- 
ing may be done in a convenient 
situation. The resultant scale may 
then be used to test the model in 
situations in which the solution of the 
scaling equations would be difficult or 
impossible. 
In this connection, it should be 
observed that the scaling procedure 
itself may be employed in such a way 
as to give a consistency check. If the 
distance variable is divided into sub- 
ranges, the scaling procedure may be 
applied to the data of each separately. 
Theoretically, the same scale should 
be obtained in each subrange so that 
comparison of the separately obtained 
scales provides a test of the theory, 
As has been seen, the logic of the 
present scaling technique consists in 
using the postulated behavior laws to 
induce a scaling on the dependent 
variable. In principle, this same ap- 
proach could be applied to any a 
which postulates quantitative laws o 
behavior although the detailed pro- 
cedure would depend on the form of 
those laws and on the experimen- 
tal design employed. The resulting 
scales would of course be correct m 
within the context of the postulate: 
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laws. However, they would have the 
considerable merit of being directly 
relevant to those postulates and of 
being directly associated with the 
development of the theory. 

Finally, it should be noted that 
possession of a theoretically adequate 
scale of the dependent variable opens 
up the possibility of scaling the under- 
lying variables. Miller's (1959) sug- 
gestion that a change in drive raises or 
lowers the whole gradient may be 
taken as one example. The difference 
between the scaled response before 
and after the change in drive should 
then represent the effect of that 
change in a theoretically adequate 
way. This would, for instance, give 
a basis for equating drives induced by 
different experimental operations. 


POSSIBLE RELATIONS TO 
HULL-SPENCE THEORY 


The purpose of this section is t? 
explore one possible way in which the 
present formal model might be inter- 
preted within a more general quantita- 
tive behavior theory, Such an in- 
terpretation is not necessary, © 
course, since the model is self-con- 
tained, but there are at least two 
reasons for so proceeding. An articu” 
lation of the model with an existing 
theory would presumably help tO 
define the meaning of the model Pa 
rameters. More importantly, suc! 
articulation would be expected to she 
light on the validity of the assump- 
tions and to indicate possible direc- 
tions of change if change should prov? 
necessary. 

It is appropriate to restrict 2 
discussion to general Hull-Spenc* 
theory. The treatment of this theo? 
May seem somewhat casual but i 
should be realized that the intent i$ t 
indicate that an interpretation of E 
model within the theory may be up 
Sible and useful. 
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The following notation will be used. 
E, H, D, and K refer to reaction 
potential, habit strength, drive, and 
incentive motivation, respectively. 
Superscripts + and — index quanti- 
ties associated with the approach and 
nonapproach responses, respectively. 
Thus the approach gradient is denoted 
by E*(x), and the avoidance gradient 
by E-(x). 

Attention will be restricted to 
the following experimental situation. 
"The subject is trained with a fixed 
reward and punishment until his be- 
havior has stabilized. Theoretically, 
the values of E+ (x), E~ (x), H+ (x), and 
II-(x) will then have reached asymp- 
tote. He is then given one of two 
instructions, either that the reward on 
the next trial will be greater (or less). 
or else that the punishment on the 
next trial will be greater (or less). 
Subscripts 1 and 2 will refer to 
quantities on the trial preceding and 
on the trial following this instruction, 
respectively. 


Case a: Effect of Changed Reward 


When the behavior has stabilized, 
the reaction potential difference cor- 
responding to the difference between 


the gradients may, within Hull's 
theory, be written as: 
E+ (x) — E-(x) 

= K*DIH*(x)— E-(x). [10] 


The effect of the instruction will 
presumably be on K+. Denote the 
new incentive motivation by ak+ 
where a is constant. The reaction 
potential difference after the instruc- 
tion is then: 

E*t(x) — E-(x) 
= eKt+DH* (x) — E-(x). [11] 

To relate this to the model, set 
f(x) = H+ (x), a1 = KtD, a = aK*D, 

=c=0, The Hullian equations 
are thus seen to be consistent with the 
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model, at least for this situation. 
Conversely, this application of Hullian 
theory is seen to justify A as regards 
the approach gradient. 

An analogous treatment in terms of 
Spence's (1956, p. 151) theory is 
more problematical. Apparently, K* 
should be considered as a function of 
x, but Z+ should be taken as inde- 
pendent of x. Although it would 
seem that D should be considered as 
dependent on v, it will be assumed 
here that D is the same at all distances 
from the goal, an assumption which 
will be given some justification below. 
These assumptions yield, analogous to 
Equation 10: 


E*(x) — E- (x) 
= [K + Dt — E-(x). [12] 


The effect of the instruction is again 
presumably to change the incentive 
motivation and it is assumed that the 
new incentive motivation is related to 
the old by the expression, aK+ (x) + 8, 
where a and 8 are constants. It will 
be noted that this assumption includes 
a number of special cases: additive, 
witha = 1; multiplicative, with 8 = 0; 
and exponential growth with 0 Ca €1, 
B-—(1—o)Ky*, where Ky* is the 
maximum value of the incentive 
motivation. This yields, analogous 
to Equation 11: 


E*(x) — E-(x) 
= [eK+(x) + 8 + DyIf* 
— E-(x) = alI*K* (x) 
+ BH* + DH* — E-(x). — [18] 


Setting f(x) = K*(x), a = Ht, 
a = allt, by = 0, b = BH*, and 
c = DII*, it is seen that this inter- 
pretation of Spence's (1956) theory 
gives a result which is consistent with 
the model, at least for the given ex- 
perimental situation. In particular, 
it justifies A; as far as the approach 
gradient is concerned. 
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Case b: Effect of Changed Punishment 


In this case, the problem is whether 
an analysis of E^ can be made which 
is similar to the analysis of £+ given 
in Case a. This does not seem pos- 
sible within either Hull's or Spence's 
theory. For a threat of increased 
punishment would presumably in- 
crease the drive, D, and change the 
drive stimulus, Sp. The increased D 
would, as is well-known, theoretically 
act to increase the reaction potential 
difference, and hence also the speed of 
response, at all distances up to that 
point at which the subject no longer 
approaches. The changed drive stim- 
ulus would presumably have the op- 
posite effect, of course, but unless the 
changes in D and Sp exactly balanced 
out in their effect on the approach 
tendency, the assumption of independ- 
ence of the approach and avoidance 
gradients would be incorrect. It 
would thus appear that Miller's (1959) 
theory is, in this respect, inconsistent 
with the theories of Hull and of 
Spence. 

There is, however, a possible alter- 
native approach. This is to postulate 
a negative incentive motivation, K-, 
which acts on H- in the same way that 
K* acts on Ht, and to assume that the 
effect of the punishment instruction 
is on K~. The theory would then be 
symmetrical with respect to approach 
and avoidance so that the analysis of 
E+ in Case a would carry over directly 
to the analysis of E- and exactly 
analogous conclusions would follow. 

'The concept of negative incentive 
motivation as used here would seem 
to merit some consideration. Such a 
concept has been used by Logan 
(1960, p. 222), and it is consistent 
with Miller's (1959) stimulus inter- 
pretation of drive. Indeed, i K 
were introduced, the concept oft gen- 
eralized drive, D, might be no longer 


necessary. 
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It should also be noted that, even if 
II and I~ were constant at asymp- 
tote, K+ and A~ might well still show 
trial-to-trial variation as a function of 
the sequence of reinforcing stimuli. 
If so, the learning formulation given 
in an earlier section would be directly 
applicable to the asymptotic data. 


Discussion 


Any psychological theory has four 
aspects. First, a symbolic represen- 
tation of the organism or, in less 
nominal terms, a delineation of the 
psychological processes underlying the 
behavior. Second, a set of laws relat- 
ing these processes to their causal 
environmental determinants. Third, 
à set of combination rules which state 
how the underlying processes combine 
or coact to produce the observable 
behavior. Fourth, a set of measure- 
ment scales for the dependent an 
independent variables, and for the 
underlying processes. This fourth 
aspect is, of course, implicit in the 
first three but it will be useful to 
consider it in its own right. , 

The principal concern of the present 
model is with the last two aspects: 
The basic combination rule of the 
model is the assumption, Ag, that the 
Overt response is proportional to the 
difference between the strengths of the 
two underlying processes, namely, the 
approach and avoidance gradients: 
By applying this assumption, under 
the general guidance of Assumptio? 
Au, to various experimental situations: 
Precise theoretical predictions 4€ 
derived and serve as a test of the 
model. An important by-products 
one which often results from a quan 
tative theoretical analysis, is f^ 
illumination of various experiment? 
and statistical considerations of ve 
kind discussed above which are E 
volved in the detailed application i 
the model. This by-product WOY 
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seem the more valuable in that the 
relevance of a number of these con- 
siderations does not appear to be 
limited to the specific conflict model 
employed here. 

It will be noted, however, that the 
exactness of prediction conferred by 
À» is not entirely satisfactory because 
of the arbitrary choice of a measuring 
Scale for the dependent variable. 
Although response speed may be a 
reasonable approximation to a theo- 
retically adequate scale, it would be 
optimistic to believe that it will be 
exact. Without a guarantee of Ag, 
however, it would be difficult to decide 
whether a deviation from prediction 
signals a basic flaw in the theoretical 
Structure, or simply an inappropriate 
choice of scale of measurement. The 
difficulty is resolved by the scaling 
technique which yields a theoretically 
adequate scale of the dependent vari- 
able, i.e., a scale which is correct 
within the model employed. 

- The present development has little 
direct relevance to the first two as- 
pects of theory listed above. The as- 
sumed underlying processes, namely, 
the approach and avoidance tenden- 
cies, are just those postulated by 
Miller (1959). Moreover, the scope 
of the model is narrow and formal in 
contrast to the substantive investiga- 
tions of Miller and his colleagues 
which have been largely directed at 
the establishment of a set of causal 
laws, represented in particular by the 
assumption of gradient independence. 
Nevertheless, the model does bear 
indirectly on these two aspects. For, 
as has been seen, the success or failure 
of the model rests finally on Assump- 
tion A, If the model is rejected, 
then the assumption of gradient in- 
dependence must be abandoned, or a 
different representation of the organ- 
ism must be sought. In addition, it 
Would seem that a theoretically ade- 
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quate scale of the dependent variable 
is useful in defining the underlying 
processes and prerequisite to estab- 
lishing scales for them. 

Although much of the glamor of 
mathematical models seems to be 
associated with their predictive pre- 
cision, this is not a good in itself. 
Prediction is usually only a means to 
the end of understanding behavior. 
Moreover, when there are several 
underlying processes, even a quantita- 
tive attack may yield only qualitative 
results. The advantage of a mathe- 
matical over a verbal formulation is 
that the former brings the basic ideas 
of the theory more quickly and more 
definitely to test. It is in this light 
that the present work should be 
viewed. 

SUMMARY 


This paper presents a mathematical 
model based on N. E. Miller's (1959) 
conflict theory and requiring minimal 
assumptions. An experimental situa- 
tion is described and exact predictions 
are derived for it. A technique for 
obtaining a theoretically correct meas- 
urement scale for the dependent vari- 
able is also given. Finally, possible 
relations of the model to Hull-Spence 
theory are discussed. 
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The dialogue between stimulus-re- 
sponse theories and cognitive theories 
has antecedents going back at least as 
far as the beginnings of scientific psy- 
chology. Protestations that the conflict 
is imaginary or merely semantic are un- 
likely to eliminate fundamental theoret- 
ical differences. To argue that we are 
dealing with the same data regardless 
of theoretical predilections (e.g., Kend- 
ler, 1952) would sound strangely in- 
congruous in the context of other sci- 
ences. The question "What is the 
center of the universe?" was not a theo- 
retical blind alley, and neither is the 
question “What is learned?" as Camp- 
bell (1954) and Rozeboom (1958) 
have shown rather convincingly. In 
the present paper we shall examine 
briefly the major differences between 
the two positions with regard to the 
definition of response or output and 
then proceed to illustrate a possible lo- 
cus of transition from associative to 
cognitive or structural processes. 

The distinction between association- 
ist and cognitive views may be de- 
Scribed as a difference between a topo- 
graphic and a categorical response 
definition. While the associations of 


1This paper was prepared in connection 
with Research Grant M-4852 from the Na- 
tional Institute of Mental Health, United 
States Public Health Service. The section 
on overlearning and transfer was presented 
at the American Psychological Association 
Symposium on Transfer of Training in Sep- 
tember 1957. The initial version of the paper 
was prepared while the author was a Fellow 
at the Center for Advanced Study in the Be- 
havioral Sciences, Stanford, California. He 
is indebted to William Kessen, who contrib- 
uted not only valuable critical goads but also 
additions of substance, and to Jean Mandler 
for her many suggestions. 


the associationistic schools are usually 
considered to be between discrete 
events in the environment (the stimuli) 
and topographically well delimited and 
discrete events associated with the or- 
ganism (the responses), the cognitive 
theorist is more likely to talk about for- 
mal categories of behavior, giving rela- 
tively less attention to the topographic 
aspects of the behavior classed within 
those categories. The important ques- 
tions are: Do organisms learn general- 
izable, but discrete, responses in specific 
situations or are rules of behavior, 
maps, or schemata laid down which 
connect various behaviors and environ- 
mental inputs? Do organisms learn 
what to “do,” or do they learn “what 
leads to what"? While the distance be- 
tween the two views has been reduced 
somewhat by suggestions such as Lo- 
gan's (1960), who aggregates re- 
sponses as a function of differential re- 
inforcement, the problem of response 
definition still appears to be basic. 

The so-called cognitive theories be- 
long to a much larger class of theories 
which have claimed to be "structural," 
including not only the gestalt school 
and its heirs but also the speculations of 
Piaget, Bartlett, and Hebb, among 
others. What these positions have in 
common is a postulation of the organi- 
zation of behavior not derivable from 
any combination or association of stim- 
ulus-response links. 

Rather than reserving the construct 
of “structure” for this kind of theory, 
we would prefer a more general state- 
ment of the structural position. Stric- 
tures are temporal and probabilistic 
linkages of inputs and behavior which 
are available in functional units. These 
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units—broadly conceived—may be as 
simple as a reflex arc, as associationistic 
as a habit-family hierarchy, or as cog- 
nitive as a means-end relationship. The 
output of a structure may be topograph- 
ically limited or categorically defined ; 
the question to be addressed concerns 
the conditions under which structures 
are developed and the relation between 
structures which mediate topographi- 
cally limited behavior and those which 
mediate behavior described in categori- 
cal form. 
`The major notion to be presented in 
this paper and applied to a limited set 
of phenomena is the Proposition that an 
associationist and a cognitive view of 
behavior may work side by side—that 
they are compatible. More specifically, 
it will be maintained here that there is 
now enough evidence available to sug- 
gest that, in some limited Cases, cogni- 
tive characteristics of the organism may 


be developed out of associationist 
processes, 
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The broad area of evidence to which 
we will appeal in support of such a 
proposition covers the warm-up effect 
and the development of learning sets. 
More specific evidence will þe de- 
veloped out of a series of studies, 
strewn through the literature, on the ef- 
fects of overlearning on transfer of 
training. , 
We can briefly make some distinc- 
tions between three types of effects: l 
The warm-up effect. Prior experi- 
ence in the particular setting of an ex- 
periment results in an increasing facili- 
tation on the criterion task as a func- 
tion of amount of such experience 
(Mandler, 1956; Thune, 1950). Thus 
Irion (1948) described the T Bp 
effect in general; while Thune ( 5 ) 
demonstrated that both color-guessing 


and paired-associate learning were 
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equally effective in facilitating learning 
of a test list. Hamilton (1950) sug- 
gested that these effects may be rela- 
tively transitory in nature. 4 

Learning to learn. When a subject 
is given repeated and related tasks, 
there is an increasing facility in the 
performance of these tasks. While this 
effect has been most dramatically dem- 
oustrated by Harlow (1949), Bair 
(1902) in a classic study was able to 
conclude that "learning to do one thing 
gives us the capacity for learning to do 
other things" (p. 45). Most recently, 
Duncan (1960) showed that perform- 
ance improved markedly as a function 
of number of prior tasks. 3 

Overlearning and transfer. High 
degrees of learning of a particular re- 
sponse lead to positive transfer to 2 
incompatible response, even though 
lower levels of learning produce nega- 
tive transfer (cf. Reid, 1953). 

These three general effects all have 
one characteristic in common, practice 
on one task may have a facilitating ef- 
fect on another task even though the 
behavior required in the two tasks 15 
topographically different, and may even 
be incompatible, ; 

To what extent can current associ- 
ationist Stimulus-response theories ac- 
commodate to these phenomena? "s 
far as the warm-up effect is concerne i 
it seems to be reasonable to assume tha 
within broad limits a warm-up task 
permits the subject to "assume mo 
rapidly the postural and ‘attentive’ ad- 
justments necessary for optimal pet 
formance” (Thune, 1950, p. 438). 

Owever, when we deal with learnt 
sets and overlearning, a topographica A 
limited stimulus-response theory wo 
times fails to do justice to the P. 
nomena, ted 

The general position to be adop à 
here in respect to learning sets P 
overlearning is similar to that pea 
originally by Harlow (1949) and € 
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panded subsequently (1959).? Harlow 
Suggested that the formation of hy- 
potheses and insight is a function of the 
continuous development of learning 
Sets, rather than the discontinuous ap- 
pearance and replacement of cognitive 
Structures. The writer has previously 
suggested (Mandler, 1954a) that dur- 
ing overlearning, i.e., continuous error- 
less performance, there are developed 
Symbolic analogues—we now prefer to 
call them analogic structures—of the 
Overt responses. The process unfolds 
in the following manner: First, the or- 
ganism makes a series of discrete re- 
sponses, often interrupted by incorrect 
ones. However, once errors are 
dropped out and the sequence of be- 
havior becomes relatively stable—as in 
running a maze, speaking a word, re- 
producing a visual pattern—the various 
components of the total behavior re- 
quired in the situation are "integrated." 
Integration refers to the fact that pre- 
viously discrete parts of a sequence 
come to behave functionally as a unit; 
the whole sequence is elicited as a unit 
and behaves as a single component re- 
sponse has in the past; any part of it 
elicits the whole sequence. This proc- 
ess of integration has been called “re- 
sponse learning" (Mandler, 1954a) 
and may be applied to any consistent 
responding of whatever topography and 
complexity. Thus, pressing a button is 
integrated as much as running a maze 
or speaking a sentence. Once a re- 
sponse sequence has been integrated 
and acts as a unit, it develops a struc- 
tural representation, a “central” ana- 
logue of this new response unit which 
can function independently of the overt 
response sequence. 

The notion of integration deals in 


2The point of view presented here has 
familial tps with Hebb's (1949) theory, with 
some of Piaget's (1952) ideas, and with a re- 
Cent essay by Miller, Galanter, and Pribram 
(1960), 
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part with the problem of defining units 
of behavior. When complex behavior 
is built up from previously disparate 
units, much trial and error behavior 
may characterize the slow development 
of the new, larger unit. The fact that 
the components may themselves have 
been overlearned—and structurally rep- 
resented—lends a peculiarly “cogni- 
tive” flavor to the explicit or implicit 
hunt and peck search for the appropri- 
ate components. However, once the 
units have been selected and the new 
sequence has been integrated, it tends 
to be elicited as a whole and has a 
quasi-automatic appearance in the way 
it is evoked by the appropriate situ- 
ational cues. It is this development 
from unsure steps to the well integrated 
sequence that characterizes learning to 
drive a car, to type a word, or to run a 
maze. Such a process is related to 
Miller's (1956) discussion of the 
chunking of behavior and probably led 
to Tolman’s (1933) conclusion that 
"the Sign-gestalt formula is appropri- 
ate only for the early stages of learning 
and not for the later ones" (p. 254). 
Analogic structures permit covert 
trial and error behavior, i.e., cognitive 
manipulation of previously established 
behavior. In this sense, the analogic 
representation of a prior behavior se- 
quence is one possible “hypothesis” to 
be applied to a particular situation. 
Given many such structures which are 
relevant to a situational input, the sev- 
eral structures will occur seriatim and 
covertly until an appropriate one is ex- 
pressed behaviorally. If it is incorrect, 
à new structure may occur, thus giving 
the appearance of an entirely discontin- 
uous process, of shifting from hypothe- 
sis to hypothesis. We say “appear- 
ance" of discontinuity because a contin- 
uous process prior to the appearance 
of cognitive structures in fact gives rise 
to them. Structures are developed on 
the basis of associationist stimulus-re- 
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sponse relationships but, once estab- 
lished, enable the organism to behave 
“cognitively.” In Harlow's (1959) 
words: “. . . it is the amount of intra- 
problem learning which determines the 
interproblem transfer” (p. 502). 

In the learning set studies, the prob- 
lem is not one of covert trial and error, 
ie. the choice between two incompat- 
ible or antagonistic responses, but 
rather one of positive transfer from one 
problem to the other when the tasks 
have formal similarities, but no overlap 
in content. What will be suggested 
here—and this is one of the reasons 
why we should move from specific sym- 
bolic analogues to more general struc- 
tures—is that in any one task, e.g., in 
discrimination learning, structures (or 
learning sets) are developed which are 
formally identical with those of the 
succeeding and preceding tasks. Thus, 
for example, the organism in choosing 
the “odd” member of a set of three ob. 
jects engages at first in overt behavior 
which has the following characteristic : 
Take three objects, A, B, and C; the 
organism physically examines all three 
objects in some order, say A, B, C (i.e., 
he makes differentiating responses to 
them—Mandler, 1955—as he actually 
manipulates them) ; whenever the dif- 
ferentiating response to A is similar to 
the differentiating response to B (they 
are "alike") then a consummatory re- 
sponse will follow the choice of C (C 
is the odd object and its choice is re- 
inforced) ; if the differentiating re- 
sponse to D is different from that made 
to A, then B will be followed by à con- 
sunimatory response ; if the differenti- 
ating response to A is followed by a 

consummatory response, no further he- 
havior is elicited. In all cases the con- 
summatory response is the stop rule. 
At this point we are still dealing with 
overt behavior during the early stages 
of learning. However, as the S RUCE 
of differentiating and overt responses 1s 
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integrated, a structure of that sequence 
will be developed. This structure has 
three nodal points each of which repre- 
sents a differentiating response and a 
choice response, none of which needs to 
be made overtly and the sequence can 
now run off in the following manner: 
Look at one object and make a differ- 
entiating response, look at the second 
object and, if the same differentiating 
response occurs as the one made to the 
first, then make an overt choice of the 
third object. Note that the matching 
requirement need not be held for more 
than two objects. Assume that A, P. 
and C have the characteristics k, k, and 
l, respectively, C therefore is the odd 
object. If the subject first examines 
and then B, then the object following 9 
match is the correct one ; if he examines 
them in the order A, C, and B then à 
repetition of the sequence will lead to 
the same occurrence of a choice fol- 
lowing a match (he has examined A, k; 
C, 1; B, k; A, k; and now chooses C) ; 
if the order is C, A, and B, a return to 
C will lead to a similar result, In other 
words, the only required structures are 
an orderly sequence of covert choices 
and an overt choice or act following 
two similar differentiating responses 
What the analogic structure permit? 
the organism to do is to eliminate the 
Overt examination of all the objects !? 
order to arrive at the consummato 
response. 

This exercise has been used to dem 
onstrate one way in which cognitive 
categorical behavior may be slow , 
built up out of regular overt behave 
Sequences. One of the major variab x 
of associative theories—frequency—?! 
Pears to be an important antecedent 
simple Structures, aCe 

The development of analogic st 

dev 
ni 


2 The similarity between the proces 
scribed here and the simulation of CO8" 
Processes with computers (e.g. News 
Simon, 1959) is obvious, 
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tures, and the resulting cognitive be- 
havior is particularly relevant to verbal 
behavior which, in the natural setting, 
demonstrates high degrees of learning 
as well as cognitive correlates. And 
finally, while the apparent cognitive 
sequelae of overlearning have been 
demonstrated in mammals (monkeys, 
men, and rats), studies by Warren and 
his associates (Warren, 1960; Warren, 
Brookshire, Ball, & Reynolds, 1960) 
fail to find these effects in fish and 
chicks, More comparative data will 
have to be collected before we can 
speculate about the phylogenesis of 
cognition. 


OVERLEARNING AND TRANSFER 


In applying the notions developed 
above to the effects of overlearning on 
transfer of training, we shall proceed 
cautiously before invoking cognitive or 
Structural explanations. We shall first 
review some of the typical findings in 
the literature, then outline the major 
variables apparently involved in the 
phenomena, and finally examine the ex- 
tent to which associative factors may 
account for them, before invoking ana- 
logic structures as operative in these 
situations. The major independent 
variable with which we will be con- 
cerned is the degree of training on the 
original task, in particular when such 
training goes beyond mere mastery of 
the original task. We will be pri- 
marily interested in transfer situations 
which exemplify the Müller-Schumann 
paradigm, the A-B, A-C situation 
where identical or highly similar stim- 
uli are used in both tasks but where 
the required response varies from the 
original to the transfer task. Over- 
learning beyond the attainment of 
asymptotic behavior provides repeated 
trials which are topographically highly 
similar from occasion to occasion, i.e., 
the same sequence of behavior occurs 
again and again. However, in some 


situations such as the learning of a 
complex maze by a rat or the learning 
of a paired-associate list bv a human 
subject, it is difficult to specify when 
asymptote has been reached. In the 
case of the rat, some choice points will 
be learned very early and while the 
animal may still be making errors at 
some other choice points, he is at the 
same time overlearning the earlier ac- 
quisitions. In the case of verbal learn- 
ing, e.g., of a list of 10 paired associates, 
the subject may be overlearning some 
pairs by the time he reaches even a low 
criterion of 3 or 4 correct out of 10. In 
this case, learning the list to a criterion 
of no errors will include some over- 
learning of some of the pairs in the list. 
'Therefore, some data from the litera- 
ture will be presented which are con- 
cerned with relatively lower degrees of 
training of the entire task but which 
probably include overlearning of some 
parts. 

First, we can examine some typical 
results from the literature on the ef- 
fects of degree of A-B training on A-C 
performance. The degree to which any 
one of these examples conforms to the 
ideal A-B, A-C situations does, of 
course, vary. Such variation extends 
from the ideal, where A is in fact an 
identical stimulus situation in both 
original and transfer tasks, to some de- 
gree of similarity between the original 
and transfer task. Similarity may im- 
ply the presence of common elements in 
the two situations or other constants 
across situations which are likely to 
elicit the original response. 

Figure 1 shows some typical results 
from both animal and human studies. 
In each case the learning of the trans- 
fer situation with no prior training has 
been used to estimate the point of zero 
transfer and amount of transfer is 
plotted against number of trials on the 
original task. Performance for groups 
that have had some original learning 
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Transfer of training as a function 
of degree of original learning. (See text for 
explanation.) 


prior to the transfer task is represented 
in terms of percentages of the per- 
formance of the group with no prior 
training. 

One of the most striking aspects of 
these graphs is the Similarity of the 
general function of the relationship be- 
tween transfer and degree of prior 
training. An initial negative transfer 
effect is followed by either a return to 
zero transfer, or the appearance of 
large positive transfer effects. The ef- 
fect is U shaped in all instances. 

The eight graphs in Figure 1 repre- 
sent animal studies on the left and hu- 
man studies on the right. Jackson 
(1932) trained his first group of rats 
to acriterion of 4 out of 5 correct trials 
on a multiple-cul maze. His second, 
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third, fourth, and fifth group learned 
to criterion plus 10, 20, 30, and 40 
overlearning trials, respectively. Trans- 
fer to a similar maze shows an initially 
large negative transfer effect followed 
by a slight positive effect with high de- 
grees of overlearning. Wiltbank 
(1919), in a study using similar ap- 
paratus but lower degrees of learning 
(0, 2, 4, 8, 16 trials, and a mastery 
criterion), obtained highly similar re- 
sults, though less negative and more 
positive transfer, : 

The next graph represents Reid's 
(1953) study, in which animals learned 
a black-white discrimination in a Y 
maze. They show negative transfer 
when reversed to the previously nonre- 
inforced stimulus if they were trained 
only to criterion on the first task. 
However, with 50 overlearning trials 
we find less negative transfer, and posi- 
tive transfer for the group that had 150 
overlearning trials, Pubols (1956), in 
replicating this particular study, finds 
a similar effect though no positive 
transfer for the highly overlearned 
group. Galanter and Bush (1959) 
have presented some data on reversal 
learning which fail to show any positive 
transfer following high degrees (144 
trials) of overlearning, Their animals, 
however, were run under conditions 
of distribution which resulted in much 
more spaced trials than Pubols used. 
Such a condition might be unfavorable 
to the development of stable structures. 

The last graph for the animal studies 
represents a study by Bruner, Mandler, 
O'Dowd, and Wallach (1958). In this 
Study animals were trained to learn 
a left-right alternation sequence in @ 
Our-unit maze, After varying degrees 
of acquisition in this original task, they, 
Were switched to the mirror image ° 
the original task, e.g., the right-left- 
right-left Tesponse. Animals eithef 
learned to a criterion of 80% correct 
or were given 20 and 80 overlearning 
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trials, respectively. Once again we find 
an initial negative transfer effect fol- 
lowed by a large positive transfer effect 
with 80 overlearning trials. The sec- 
ond (dotted) curve in this graph repre- 
Sents a group which was run when 36 
hours hungry, the first group was run 
when 12 hours hungry. 

We can now turn to the human stud- 
les. The first example is a study by 
Siipola and Israel (1933) in which sub- 
jects learned various telegraph codes 
to letters of the alphabet as stimuli. 
This study is not strictly an A-B, A-C 
Situation, It involved the recombina- 
tion of stimuli and responses in the 
transfer task. The same stimuli and 
Same responses as in the original task 
Were used in the transfer task with dif- 
ferent combinations of stimuli and re- 
sponses. Four groups were given dif- 
ferent degrees of training; their means 
were 12, 26, 96, and 208 trials on the 
original task, respectively. The last 
two groups received high degrees of 
overlearning. In this particular case 
there is a slight positive transfer effect 
for the first group which is followed by 
a negative transfer effect and then by 
high positive effects for the last two 
groups. 

Underwood (1949) used lists of 10 
Paired-associate adjectives. The most 
marked effect is the high degree of posi- 
tive transfer with high degrees of over- 
learning, with only little negative trans- 
fer for the group receiving the least 
amount of training on the original task. 
In this particular situation, the first 
group had 3 correct out of 10 pairs, 
the second 8 correct out of 10 pairs, and 
the third group one complete correct 
trial plus five additional trials. The 
absence in this particular case of any 
large negative effect may be due to the 
lack of a warm-up control which we 
will discuss presently. Underwood 
also gives the number of errors that 
Were made during the first two antici- 


pation trials (Trials 2 and 3) of the 
transfer task. These are shown in the 
next graph where we find negative 
transfer effects for the least trained 
group which change to zero or positive 
transfer for the more highly trained 
group. 

The next graph shows a study by 
Mandler and Heinemann (1956) in 
which subjects learned three-consonant 
nonsense syllables as responses to one 
digit numbers. The first overlearning 
group was given 10 trials beyond com- 
plete mastery of the list, the second 30 
overlearning trials, and the third 50 
overlearning trials. Again we find 
initial negative transfer effects followed 
by zero or positive transfer. 

In examining these findings, five 
general processes will be considered: 
warm-up and learning set effects, re- 
sponse competition, stimulus discrimi- 
nation, response learning, discontinu- 
ous structural factors. 

Warm-up and learning set effects 
have been grouped together because— 
in the kinds of design presented here— 
they are usually confounded. Thus, it 
is often not determinable to what ex- 
tent an animal’s prior experience in a 
maze produces varying degrees of spe- 
cific postural and attentive habits 
(warm-up) as against nonspecific 
structural effects (learning set). Pre- 
sumably both kinds of effects are oper- 
ating and may, for present purposes 
only, be considered together. But in 
order to evaluate the effects of over- 
learning on transfer, these more gen- 
eral experiential factors should be 
separated somehow from factors assign- 
able to the competition and facilitation 
between original and transfer re- 
sponses. We can examine the ideal 
model and determine what kind of con- 
trol groups are necessary to separate 
situational and nonspecific transfer ef- 
fects from those transfer effects which 
are peculiar to the juxtaposition of the 
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particular original and transfer tasks. 
The model can be conceptualized as 
follows: 


Original Transfer 
task task 
Ideal paradigm A-B A-C 
Control condition D-E A-C 


The first line, A-B, A-C, is the ideal 
paradigm. However, any experience 
with the general class of tasks which 
A-C represents may result in transfer 
effects not due to A-B alone. In the 
second line the necessary control con- 
ditions are shown. In this case a D-E 
situation is necessary, which must be 
learned to the same degree as A-B was 
learned but which contains neither the 
stimuli of the A-C situation nor the re- 
sponses of either the A-B or the A-C 
situation. In simple paired-associate 
learning this is not difficult to ac- 
complish. Obviously we can have con- 
trol groups which learn a series of 
paired-associate adjectives as in A-B, 
but with adjectives in the stimulus and 
response position which are different 
from either A, B, or C. A brief ex- 
ample of what happens when one does 
or does not use controls for this kind 
of situation is shown in Figure 2 which 
recapitulates the data from Mandler 
and Heinemann (1956) but incorpo- 
rates a control for warm-up and learn- 
ing-set effects. The first graph of the 
figure shows the raw data for the A-B, 
A-C situation, and in the second graph 
is shown the raw data for a D-E, A-C 
situation. (For comparison purposes 
the large positive effects of an A-B, 
C-B condition are also shown.) In 
both cases we find increasing positive 
transfer as a function of overlearning 
on the original task. However, the 
positive transfer for the control condi- 
tion is greater for the first two groups 
and less so for the last group. The 
resultant is shown in the third graph, 
which shows initial negative transfer 
followed by positive transfer. 
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Fic. 2. Effect of warm-up correction On 
transfer effects. (See text for explanation. 


The paired-associate situation is rela- 
tively easy to consider in controlling 
for warm-up and learning to learn ef- 
fects. It becomes more difficult whe? 
we think about some of the anima 
studies. It may be difficult to specily 
a warm-up condition which will sepa- 
Tate practice on a specific maze from 
general maze or situational experience: 
but such control groups seem to be 
essential. 

Some of the studies do suggest poss 
ble control groups. In Reid's (1953) 
study, for example, it might be pos51?" 
to use control groups who learn to dis 
criminate but do not learn a differenti? 
black-white response. Reid argues e 
the acquisition of a new response, E 
response of discriminating, which do 
not occur until late in overlearning: , 
animals were to learn a discriminatio' 


i- 
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between a triangular and a circular 
stimulus and given various degrees of 
iraining on this discrimination before 
being transferred to a black-white dis- 
crimination, we would then have a con- 
trol group which does not have the 
black-white practice, but does have 
practice in discriminating. Any result- 
ant advantage which still accrues to 
the group that had its prior practice on 
the black-white discrimination would be 
specific to that particular situation and 
not related merely to being in a dis- 
crimination task. In general there 
tends to be some confusion between 
specific transfer effects and general 
transfer effects due to warm-up or 
learning to learn. Transfer studies 
must find some way of separating gen- 
eral from more specific transfer effects. 

The first of the specific transfer ef- 
fects to be discussed is the classical one 
of response competition or interference. 
The response which was appropriate in 
the original task is elicited in the trans- 
fer task, particularly if the stimulus 
situation in the two tasks is highly 
similar, This effect has led past in- 
vestigators to call the A-B, A-C para- 
digm the negative transfer paradigm 
par excellence. Despite the failure to 
find such a result in the overlearning 
studies, the available evidence in trans- 
fer studies suggests that the interfer- 
ence or competition hypothesis need 
not be abandoned. What appears to 
be the case is that interference effects 
occur very early during the transfer 
task and that "factors making for posi- 
tive transfer do not gain the 
ascendancy over the negative tenden- 
cies until the latter stage of the learn- 
ing process" (Wiltbank, 1919, p. 42). 

The important point is that the prob- 
ability that Stimulus A, which is pres- 
ent in both situations, will elicit Re- 
sponse B, which was learned in the 
original task and which interferes with 
the performance of Response G me 


quired in the transfer task, does in- 
crease as a function of prior A-B pair- 
ings. But, despite this increase in 
intrusion. errors, the overall efficiency 
in learning the transfer task is in- 
creased. s 

Underwood's (1949) data show 
large interference effects during the 
first two anticipation trials, probably 
primarily due to intrusions of the now 
erroneous response during early learn- 
ing of the transfer task. In animal 
studies, Reid (1953) has shown that 
his highly overtrained animals make 
many more intrusion errors, i.e., going 
to the now incorrect stimulus, than do 
the less trained animals. The Bruner 
et al. (1958) study also indicates that 
the amplitude (in time) of the now in- 
correct "left" response at the first 
choice point of the transfer task in- 
creases as a function of degree of orig- 
inal training—though in the 12-hour 
deprived group only. 

As far as stimulus differentiation 
is concerned, the more experience the 
subject has had in differentiating the 
stimuli used in transfer, the less likely 
we are to find intratask competition 
among various stimuli (Gibson, 1940, 
1953). But whenever the stimulus 
situation in the two tasks is identical, 
as in the A-B, A-C paradigm, stimulus 
differentiation is only relevant to the 
degree that the subject has learned a 
consistent response to the stimulus in 
the original task—nothing new needs 
to be learned about the stimulus in the 
transfer task. 

More important than stimulus learn- 
ing is response learning. The degree 
to which the organism "knows the re- 
sponse" is an important factor in his 
performance of the transfer task. If 
that task requires a new response 
which has not been previously practiced 
and integrated, the early trials of the 
transfer task will be devoted to re- 
sponse learning as well as to the elimi- 
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nation of competition with the re- 
sponses of the original task.* Special 
attention must be paid here to extra- 
experimental experience. Responses 
may well have been learned prior to the 
transfer task in situations other than 
the original task. For example, in 
paired-associate learning with adjec- 
tives it is obvious that whatever ad- 
jectives are used in the transfer task, 
the subject has had some prior ex- 
perience in learning how to make the 
responses—a condition not applicable 
to some nonsense syllable learning ex- 
periments where the Subject has to 
learn the response de novo. 

Before considering discontinuous 
cognitive effects, we shall summarize 
the hypothesized effects of the other 
four factors on the transfer task as a 
function of degree of training on the 
original task. 

l. We may tentatively assume that 
warm-up activity and learning sets both 
have a monotonic facilitating effect on 
subsequent behavior, 

2. Degree of prior learning of an “in- 
compatible response" has a monotoni- 
cally increasing interfering effect. It 
also appears to be the case that these ef- 
fects, while manifested in intrusion er- 
rors, do not necessarily hinder the 
rapid acquisition of the new response, 

3. Stimulus discrimination also 
seems to have a monotonic facilitating 
effect on subsequent learning. 

4. Finally, response learning facili- 
tates subsequent acquisition of a new 
response to the extent to which the re- 

quired response has been previously 
practised. , 

There seems to be some evidence 
that Bair's (1902) and MENS 
(1919) conclusion that negative effects 

iti n our terms) 
(response competition i 


4 Underwood and Schulz (1960) have elab- 
ated earlier suggestions concerning the im- 
pea of this variable in verbal learning. 
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are maximal during early learning and 
that positive effects do not appear until 
the later stages of learning is essentially 
correct. What may also be the case 15 
that during the later stages of acquisi- 
tion or performance new responses are 
learned which can be transferred to the 
new task. 'The "observing response 

or "response of discriminating” de- 
scribed by Wyckoff (1952), Reid 
(1953), Pubols (1956), and others, 
represents the learning of a new re- 
Sponse which is easily transferred to 
the new situation and which, as Pubols 
suggests, is acquired during the later 
stages of the learning process. How- 
ever, a recent study by Birch, Ison, and 
Sperling (1960) using single stimulus 
presentations showed essentially similar 
results for the effects of overlearning 
on discrimination reversal. They con- 
clude that their findings “effectively 
tule out interpretations . . . in terms 
of responses of discriminating simul- 
taneously presented discriminanda" (P 
40). Birch et al. also appear to dis- 
pose of the suggestion by D'Amato 
and Jagoda (1960) that “overlearning 
should not facilitate reversal learning 
in a successive discrimination situa- 
tion, since avoidance of S. is main- 
tained by the very nature of the 
situation” (p. 259). 

If we apply these various factors tO 
the studies summarized in Figure 1, one 
Possible conclusion is apparent. 
negative factors such as response com- 
petition are dominant after low de- 
grees of original learning, and if posi- 
tive factors such as warm-up and learn- 
ing set do not appear in full force until 
later Stages of learning, i.e., during 
Overlearning, then an initial negative 
transfer effect followed by a reversa 
in a positive direction, after the organ- 
ism has had longer experience with thé 
original task, is predictable. Such a? 
explanation need not invoke any cogn 
tive discontinuous processes, Wilt- 
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bank's, Jackson's, Reid’s, and Pubols' 
studies all might be explained in terms 
of specific learning occurring during the 
original task. The experiment by 
Bruner et al. (1958) designed specifi- 
cally to demonstrate the acquisition and 
use of rules, is probably also a case of 
response transfer, since an integrated 
right-left-right-left sequence could eas- 
ily be transferred to a left-right-left- 
right sequence—large chunks of the re- 
quired behavior are practiced during 
original learning and probably well 
integrated during overlearning of the 
original task. A more crucial question 
would be raised if a control group 
similar to the one suggested for the 
Reid study were included. If animals 
were trained on a circle-triangle dis- 
crimination and then transferred to a 
black-white discrimination, their be- 
havior on the black-white task would 
provide a baseline for evaluating rever- 
sal effects. Thus, once reversal data 
are adjusted for the effects of varying 
degrees of prior maze experience, it 
will be possible to state whether the 
overlearning reversal effects can be ex- 
plained in terms of maze-specific be- 
havior or whether animals during over- 
learning acquire something other than 
warm-up, learning sets or discriminat- 
ing responses, namely some cognitive 
Structure about black and white stimuli. 

In the situation where the subject 
learns to make a new response to an 
old stimulus, the mere learning of the 
old response is irrelevant. If we fur- 
ther assume that stimulus discrimina- 
tion is not a major factor contributing 
to transfer, we are left with three ef- 
fects. Response competition, the major 
negative effect, should increase to 
asymptotic levels asa function of prior 
pairings of old stimulus and old re- 
sponse. Warm-up and learning-set ef- 
fects should increase more markedly 
during the later (overlearning) trials 
and produce positive transfer effects. 


The learning of new nonspecific re- 
sponses (e.g. observing responses) 
probably also appears late during origi- 
nal learning and has positive effects. 
However, once the appropriate control 
groups are used, all of the positive ef- 
fects should be controlled and we 
should be left only with the negative 
effect of response competition. When 
these negative effects change to zero or 
positive effects despite the use of the 
appropriate controls (as in Mandler, 
1954b; Mandler & Heinemann, 1956; 
and Siipola & Israel, 1933; and possi- 
bly in Underwood, 1949) it is appro- 
priate to invoke some other hypothesis. 
We have suggested that overlearning 
experience with the old response and 
the formation of an analogic structure 
permits the subject to manipulate that 
response, to "think about" the problem 
without making overt errors. Just as 
we assume that the animal in the oddity 
problem can use cognitive structures to 
eliminate the need for overt choices, 
and in discrimination problems can 
manipulate black-white choices covertly 
by inhibiting responses to the "incor- 
rect" stimulus, the subject in the A-B, 
A-C situation may be able to eliminate 
overt B responses. 

Starting off with the assumption of 
a continuity from simple associative 
structures to cognitive structures, we 
have examined one area of research in 
detail to find evidence for the appear- 
ance of cognitive structures. Our in- 
vestigation has uncovered a variety of 
associative factors which are adequate 
to explain what may at first glance ap- 
pear to be discontinuous behavior. This 
will, of course, be no surprise to the 
committed S-R theorists who have al- 
ways maintained that most, if not all, 
observations on apparently cognitive, 
categorical behavior will, in the long 
run, be amenable to an associationistic 
explanation. While we would agree 
that sophisticated associationist posi- 
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tions (e.g., Miller, 1959; Spence, 1956) 
have been too readily abandoned by 
their critics, we would maintain that 
some behavioral phenomena defy cur- 
rent  associationistic interpretation. 
However, recourse to cognitive struc- 
tures should await the verdict of the 
laboratory; the proper use of some oí 
the control conditions suggested above 
and a more fine-grained analysis of 
warm-up effects and learning sets will 
determine which of the phenomena dis- 
cussed here will ultimately call for 
cognitive explanations. 


SUMMARY 


Given the distinction that associa- 
tionist and cognitive theories make in 
respect to the definition of response, 
we have examined the proposition that 
one source of cognitive behavior may be 
found in associative antecedents. The 
argument for a transition from associ- 
ations to cognitions was buttressed by 
reference to the development of learn- 
ing sets and developed out of the no- 
tion of response integration and the 
emergence of analogic structures. Em- 
pirical evidence on the effect of over- 
learning on transfer of training was 
examined in detail to differentiate find- 
ings that seem to be amenable to as- 
sociative explanations from those that 
seem to call for structural, cognitive 
explanations. 
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STIMULUS-INTENSITY EFFECTS IN RESPONSE 
EVOCATION' 
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It is a well-established fact that the 
strength of a response varies directly 
with the intensity of its eliciting 
stimulus. The relationship holds for 
a wide variety of reactions and within 
the range of stimulus intensities from 
the lower absolute threshold to the 
level at which the receptors suffer 
physical damage. Despite the wide- 
spread recognition of the phenomenon, 
two basic problems remain unsolved. 
First, it is not known exactly what 
property of the stimulus situation 
causes the effect, i.e., there is no 
precisely articulated theory of stim- 
ulus-intensity effects which is firmly 
established. Second, the question 
arises as to whether the amount of 
true learning, as distinct from per- 
formance, varies with the intensity of 
the eliciting stimulus during training. 
Beginning with an outline of the two 
theories advanced to explain stimulus- 
intensity effects, the following discus- 
sion deals briefly in that context with 
attempts to assess the role of stimulus 
intensity in learning; the greater part 
of the paper is then taken up with a 
consideration of some empirical evi- 
dence on response evocation which 
seems to allow a choice between the 


two theories. 
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THEORIES oF SrtMULUS-INTENSITY 
EFFECTS 


The first theorist to deal with the 
stimulus-intensity problem in formal 
terms was Hull (1951). In typical 
fashion he began by making it the 
subject of a postulate and, in effect, 
assumed a constant relationship be- 
tween response strength and stimulus 
strength, other things being equal. 
He named the effect "stimulus-inten- 
Sity dynamism" and represented it in 
his behavior system with the interven- 
ing variable V. "Dynamism" then 
became the Stimulus-intensity com- 
ponent of reaction potential, acting 
multiplicatively with the other posi- 
tive factors in the system. Hull 
(1952) later proceeded beyond the 
mere assumption of a relationship to 
à more elaborate theory in which he 
attempted to explain the effect with 
the concept of the molar stimulus 
trace. According to this formulation, 
the physiological effect of stimulation, 
Occurring when a brief stimulus im- 
Pinges upon a suitable receptor, 
reaches a maximum about 450 milli- 
Seconds after the beginning of the 
Stimulus. Hull thereby limited the 
stimulus-dynamism process to the 
onset of a stimulus and to short-la- 
tency responses such as the eyeblink. 
He also gave the appearance of allow- 
ing for the effect of stimulus intensity 
9n amount learned by distinguishing 
between Vi, the dynamism of the 
Stimulus during the learning of the 
response, and Vs, the dynamism 9 
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the stimulus during the later evocation 
of the response. At one point Hull 
(1951) did suppose that habit strength 
during learning depended on Vi, but 
he later (1952) shifted to the idea that 
reaction potential during training in- 
volved V; asa component. It would 
appear that, in his later theorizing, 
Hull confined V to the strength of the 
signaling stimulus trace as dependent 
on the age of the trace instead of on 
the intensity of the stimulus. Thus 
he implied that Vi affected actual 
learning through stimulus-response 
asynchronism rather than through the 
strength of the eliciting stimulus. 
Hull was not clear on this point, but 
the final version of his theory does not 
seem to allow the deduction that 
amount learned increases with the 
intensity of the actual stimulus. 

The only other theory of stimulus- 
intensity effects has been proposed by 
Perkins (1953) and Logan (1954). 
Working independently these con- 
tributors arrived at theories which are 
so similar that they will be treated as 
one for the present purpose. This 
theory is based on the assumption 
that, as a given response is being 
elicited by a given stimulus with rein- 
forcement, so the stimulus gains exci- 
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tatory properties which generalize to 
similar stimuli ; conversely, the theory 
assumes that since the occurrence of 
the same response to other stimuli in 
the situation is not reinforced, then 
these other stimuli gain inhibitory 
properties which also generalize. The 
latter class of stimuli is said to be 
made up of "background cues" or 
intertrial stimuli, or to constitute the 
"contextual | environment." This 
state of affairs, the result of differ- 
ential conditioning, is represented 
schematically in Figure 1. If a tone 
is used as a conditioned stimulus (CS) 
in a classical conditioning situation, 
for example, then it is assumed to 
acquire the same degree of gross exci- 
tatory potential in separate groups 
trained with a weak tone (W in Case 
A on the left) and a strong tone (S in 
Case B on the right), and this gener- 
alizes along the stimulus-intensity 
dimension as shown. When a re- 
sponse is given to the intertrial 
stimulus of “silence” (0, the ambient 
noise level) during this training period, 
it is not reinforced and the intertrial 
stimulus acquires inhibitory potential 
which also generalizes. The further 
assumption that inhibition is to be 
subtracted from gross excitation to 
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Schematic diagram of the hypothetical gradients of excitation (continuous line) 


and inhibition (broken line) after reinforcement of responses to the onset of a weak tone (W) 
as CS (Case A) and a strong tone (S) as CS (Case B) with nonreinforcement of responses to the 


intertrial stimulus of "silence" (0). 
for the weak CS—Ew.) 


(Net excitation is greater for the strong CS—E ,—than 
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give net excitation allows the Perkins- 
Logan theory to predict greater net 
excitation for a strong tone (Es) than 
for a weak tone (Ew), with the 
consequent expectation of a stronger 
response in the former case. 
The Perkins-Logan theory assumes 
that amount learned is independent 
of the intensity of the eliciting stim- 
ulus. If it were to be shown that 
learning is superior with a stronger 
CS then the case for the deduction 
made above would be strengthened, 
gross excitation being greater with the 
strong than with the weak tone. A 
later deduction taking drive level into 
account, however, would be altered in 
this case. 'The present application of 
the theory also assumes that the inter- 
trial or "background" stimulus ac- 
quires just enough inhibitory potential 
to prevent a response occurring be- 
tween reinforced presentations of the 
stimulus; this should be the case, 
because as enough inhibition to block 
the response is generated so the re- 
Sponse will no longer occur unrein- 
forced and no more inhibition can 
accrue to the stimulus. It will also be 
noted that the generalization gradi- 
ents have been depicted as linear, 
following Perkins (1953). This has 
been done in the present instance be- 
cause it is felt that the exact shape is 
yet to be firmly established, Perhaps 
the weight of evidence favors the 
existence of negatively accelerated 
gradients (e.g., Raben, 1949) and one 
of the following deductions is made on 
this basis, but the remainder of the 
argument is not significantly affected 
by the assumption of a variety of other 
types of gradient. | 
"Further consideration of the ac- 
count of the Perkins-Logan theory 
given above, read in conjunction with 
Figure 1, may lead to the inference 
that stimulus-intensity effects are to 
be understood as the result of dis. 
crimination learning. In teims of the 
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example depicted in Figure 1, a strong 
stimulus produces a stronger response 
not as a result of any inherent dynam- 
ism but rather because its presence i$ 
more readily discriminated from its 
absence. This parallel may be drawn 
in certain instances and will be con- 
sidered wherever possible in the fol- 
lowing discussion, but it is an inexact 
analogy in some respects and is there- 
fore potentially misleading. Basi- 
cally, the Perkins-Logan interpreta- 
tion of the stimulus-intensity situation 
is put in terms of differential con- 
ditioning rather than traditional dis- 
crimination learning; i.e., the interest 
lies not in which of two or more 
responses will occur to a given stim- 
ulus, nor in which of two or more 
stimuli will elicit a given response, but 
in how Strongly a given response will 
occur when elicited by a given stim- 
ulus. To put the matter another 
way, although the effect on a given 
Tesponse of variations in the strength 
of its eliciting stimulus depends upon 
generalized inhibition from the inter- 
trial or background stimulus, these 
two stimuli are not necessarily pre- 
sented simultaneously and no discrim- 
ination is therefore required. In the 
example cited above there is temporal 
9r successive contrast between the 
positive CS and the negative intertrial 
stimuli. The case of a rat learning to 
Tun to a black card on a white back- 
ground, with spatial or simultaneous 
contrast of "cue" and backgroum 
stimuli, to which the Perkins-Logan 
theory has also been applied (e£ 
Nygaard, 1958), resembles true a 
crimination learning more closely. 
is still a form of spatial learning, 
however, for the cue and backgroun 
stimuli hold constant positions 1” 
Space and are not reversed in the 0n€ 
learning series for a given animal. He 
A final point of interest in p 
exposition of the differential condi; 
tioning theory is the probability tha 
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it applies to variations in the quality 
as well as in the intensity of the stim- 
ulus. Predictable effects should 
ensue whenever the eliciting and back- 
ground stimuli lie on some common 
qualitative dimension, with intensity 
held constant. There is no reason 
why experiments should not be con- 
ducted to test for stimulus-quality 
effects, but with the possibility con- 
sidered, the present discussion will be 
limited to intensity effects. 


STIMULUS INTENSITY AND LEARNING 


The second of the two problems 
stated at the outset concerned the 
possibility that a response is better 
learned if the eliciting stimulus is 
strong during learning. A number of 
attempts have been made to investi- 
gate this possibility, but no satis- 
factory test has yet been devised, so 
that the problem remains unsolved 
and a challenge to future workers. 
For this reason the problem cannot be 
pursued at length now, although a 
brief account of its nature may clarify 
the discussion which follows. 

The latest attack on the problem 
formed part of a study of eyelid con- 
ditioning conducted by Walker (1960), 
which may be used for illustrative pur- 
poses. Following the lead of Grant 
and Schneider (1948, 1949), Walker 
used a factorial design ; she trained one 
group of subjects with a strong CS and 
another group with a weak CS, and 
then shifted half of each of these 
groups to weak and strong CSs, re- 
spectively, in extinction, leaving the 
other half of each group on the CS 
strength used in training. The nature 
of this experimental design is repre- 
sented in Table 1, where the first 
letter in each cell stands for CS 
intensity during training, the second 
letter stands for CS intensity during 
extinction, and the measures are taken 
on the extinction series. Walker 


TABLE 1 


Tue FACTORIAL DESIGN USED 
BY WALKER (1960) 


Extinction CS intensity 
Training CS 


intensity 

Strong Weak 
Strong SS SW 
Weak WS | WW 


sought to determine the effect of CS 
intensity on habit strength by com- 
paring the performance in extinction 
of the two training groups, repre- 
sented in the two rows of Table 1, but 
there proved to be no significant 
difference. 

Unfortunately, this aspect of Walk- 
er's (1960) experiment is rendered 
invalid by the possible interfering 
effects of the shifts in CS intensity 
from training to extinction. which 
occurred in half the subjects. The 
impact of these shifts may be treated 
in terms of both the Hullian and Per- 
kins-Logan theories. Taking Hull's 
theory first, the shifts should bring 
changes both in generalized habit 
strength due to stimulus-intensity gen- 
eralization and in stimulus-intensity 
dynamism (from Vj, to V). The 
former effect would always cause a 
loss of response strength whereas the 
latter factor could produce either an 
increase or a decrease therein. These 
two factors might obscure any effect 
of Vi on sMpr and until some deter- 
mination is made of their differential 
weights in contributing to s; no 
allowance can be made for them. It 
is thus impossible to infer from Walk- 
er's data that amount learned is 
independent of CS intensity. Walker 
did not make this inference; she sug- 
gested instead that extinction data 
may not provide a proper basis for 
testing the hypothesis but failed to 
give any reason for the suggestion. 

Further reference to Figure 1 will 
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w the Perkins-Logan theory 
ts one to believe. that shifts in 
CS intensity after training might ob- 
scure any effect on amount learned. 
On the basis of this theory the ex- 
pected performance of Walker's (1960) 
four groups in extinction (Table 1) 

may be set down thus: WW-—a con- 
tinuous decline due to inhibition 
accruing to the CS as well as to the 
intertrial stimuli; WS—as for WW ; 
SS—a continuous decline from a 


higher level of performance than that ` 


initially present in WW and WS; SW 
—a sharp decrease with the shift, 
from the same higher level, followed 
by a slower decline. If these differ- 
ential effects are to be expected with a 
null hypothesis that CS intensity does 
not affect learning then it is clear that 
no alternative hypothesis may be sub- 
jected to a proper test. Recalling 
that Walker obtained no significant 
effects in extinction, with uncondi- 
tioned stimulus (UCS) intensity held 
constant, it might seem that the pre- 
dictions derived from the Perkins- 
Logan theory were not verified. It is 
doubtful, however, that Walker's 
analysis of the data did justice to the 
predictions, no account being taken, 
for example, of trends Over successive 
trials in extinction. On the other 
hand it is conceivable that the dis- 
crepancy is due to the occurrence of 
superior learning with a stronger CS, 
To return briefly to the analogy 
with discrimination learning, the shifts 
in CS intensity from training to test 
which are used in this type of study 
are reminiscent of the “transposition 
phenomenon." The analogy is still 
inexact, for the original form of trans- 
position required the preservation of 
a relationship between two stimuli 
when the absolute intensity of both 
was changed; argument then took 
place as to whether the relationship 
was vital to the learner (gestalt 
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theory) or to the theorist (S-R theory). 
In the factorial studies of stimulus- 
intensity effects, the absolute value of 
only the CS or cue stimulus is changed 
and that of the intertrial or back- 
ground stimuli remains constant. 
Such experiments could be conducted, 
of course, with shifts in the intensity 
of both sources of stimulation, so as 
to approach true transposition more 
closely. . 
This short account of the possible 
role of stimulus intensity in learning, 
with Walker's (1960) study as an 
example of relevant experimental 
evidence, should suffice to lend weight 
to the claim that no satisfactory test 
of this aspect of the stimulus-intensity 
phenomenon has yet been devised. 
It is therefore fortunate that a clear 
distinction may be made between 
effects on learning and on perform- 
ance. The remainder of this paper 1s 
accordingly devoted exclusively to 
effects on Performance; it is hoped 
that the Preceding examination of 
effects on learning will allow the 


reader to bear this important distinc- 
tion in mind. 


STIMULUS-OrrsET EFFEcts 
ON PERFORMANCE 


In considering the implications of 
his theory, Logan (1954) suggested 
that the offset of a stimulus should be 
as effective as jts onset in eliciting 4 
response and that the strength of the 
Tesponse should vary with the in- 
tensity of the offset stimulus. It is 
difficult to see how a strict interpreta- 
tion of Hull’s stimulus-dynamism 
Principle could predict such an out- 
Come. Hansche ang Grant (1960) 
have shown that offset and onset are 
equally effective as a CS in eyelid 
conditioning, and Schwartz (1958) 
found avoidance response strength in 
Tats to vary directly with degree of 
change in CS intensity, regardless of 
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EXCITATION OR INHIBITION 


TONE STRENGTH 
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Fic. 2. Schematic diagram of the hypothetical gradients of excitation (continuous line) 
and inhibition (broken line) after reinforcement of responses to the offset of a tone (onset of 
"silence") asa CS with nonreinforcement of responses to a weak tone (Case A) or a strong tone 
(Case B) as the intertrial stimulus. (Net excitation is greater with the offset of the strong 
tone—E ,—than with the offset of the weak tone—Em-.) 


the direction of the change. The 
detailed application of the Perkins- 
Logan theory to the case of stimulus 
offset as a CS is represented schemati- 
cally in Figure 2, in which the symbols 
are the same as for Figure 1. In this 
Situation it is the state of "silence" 
which should acquire excitatory prop- 
erties, whereas the weak or strong 
intertrial stimuli come to inhibit the 
response. Even so, the net excitation 
with a strong intertrial stimulus (Es) 
is still greater than that with a weak 
intertrial stimulus (Ew) after the 
excitation and inhibition have been 
built up in the course of training. 
Before any differential conditioning, 
on the other hand, there should be no 
difference in the strength of the re- 
sponses elicited by the two degrees of 
offset. Anexperiment was conducted 
to test these deductions; it allowed the 
findings of Hansche and Grant, and of 
Schwartz, to be confirmed, but made 
wider generality of application pos- 
sible by the use of the long-latency 
GSR. 
EXPERIMENT I 


'The subjects were 24 male and 24 female 
volunteers from an introductory course in 
psychology at the University of Sydney. The 
intertrial stimulus was a 2000-cps sine-wave 


tone delivered to the subject through head- 
phones at sound pressure levels of 80 db 
(strong) or 60 db (weak) and the CS was the 
offset of the tone for 2000 milliseconds. An 
attempt was made to choose sound levels such 
that the loud tone would not be so intense as 
to disturb the subject when presented almost 
continuously for 20 minutes and that the soft 
tone would not be so weak as to be below 
threshold for any subject. The UCS was a 
90-cps spike-wave shock of 500-millisecond 
duration which was presented 500 milliseconds 
after the offset of the intertrial stimulus as CS. 
It was set at 2.27 milliamperes as measured 
with a 15,000-ohm resistor in place of the 
subject and was delivered through thimble- 
type electrodes of German silver .5 inch in 
length, attached to the third and fifth fingers 
of the right hand. The GSR was picked up 
with electrodes of a similar type 1 inch in 
length, attached to the second and fourth 
fingers of the right hand, through which was 
passed a constant direct current of 100 micro- 
amperes; the response was measured with a 
bridge-type circuit linked to an amplifier and 
ink-writing recorder. Conductive paste was 
used in conjunction with the shock and GSR 
electrodes. The duration of the CS, the UCS, 
and the CS-UCS interval was controlled by 
electronic timers. The subject was placed in 
a sound-reduced room completely separate 
from that containing the experimenter and 
the main apparatus. 

The measure of performance used was the 
amplitude of the GSR on test trials (tone 
offset alone) which were interspersed with the 
training trials (tone offset followed by shock). 
Each series began with a test trial followed 
by one presentation of the UCS alone and a 
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second test trial Then three further test 
trials were given after 1, 2, or 3 training trials 
(third test trial), after 4, 5, or 6 training trials 
(fourth test trial), and after 7, 8, or 9 training 
trials (fifth test trial). Within this arrange- 
ment six orders of trials were used, in which 
test trials were presented after the following 
numbers of training trials: 1, 5, 9; 1, 6; 8; 
2,4,9;2,6,7;3,4,8;3,5,7. Thus the mean 
number of preceding training trials for Test 
Trials 1 through 5 were 0, 0, 2, 5, and 8 trials 
successively. The subjects were trained 
either with the offset of the strong tone as the 
CS (Group S) or with the offset of the weak 
tone as the CS (Group W). 


The inter- 
seconds and de- 
aken to achieve 
tance; it rarely 
eraged about 65 


The performance curves derived 
from the data are shown in Figure 3, 
in which the first point on the two 
curves is based on each subject’s mean 
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MEAN NO. OF TRAINING TRIALS 


Response strength on test trials 
periment I as a function of the mean 
number of preceding training trials for Groups 


5 (strong tone off) and W (weak tone off). 
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response over the first and second test 
trials. The performance of Group S 
Was compared with that of Group W 
before any training trials and a U test 
Showed that the difference was not 
significant (z = 1,51, p = 13).. The 
nonparametric test was preferred be- 
Cause of some evidence of nonnor- 
mality in the distribution of scores. 
To ensure that the groups were com- 
parable at this Stage a U test was also 
applied to the responses on the second 
trial, consisting of the UCS (shock) 
alone. The mean values were 1.57 
and 1.74 micromhos for Groups S and 
W, Tespectively, and the difference 
Was not significant (z = 0.22, p = Al). 
Inspection of the results showed that 
Group S improved more than Group 
Win conditioning and the significance 
of this finding was tested by taking the 
difference between the amplitude of 
the response on the second and third 
test trials for each subject. A U test 
showed that the gain was significantly 
greater for Group S ( = 2.12, 


b = .03). 

The results of this experiment con- 
firmed that the offset of a stimulus 
may be used as a CS and that, in the 
COurse of acquisition, the offset of a 
strong CS comes to elicit a stronger 
response than the Offset of a weak CS, 
as the Perkins-Log. 
It is emphasized that this effect is 
being treated as a matter of perform- 
igure 2), it is not being 
at true learning is greater 
Offset of the stronger CS. 
Principle of stimulus-intensity 
dynamism Was couched purely in 
terms of the absolute strength of the 
eliciting Stimulus, acting through its 
Onset. In formulating the principle: 
he cited evidence that rats jump faster 
to white than to black cards, but a law 
of absolute st; mulus strengths is mani- 
festly inapplicable to the case where 
animals respond More strongly to # 
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black (low intensity) card on a white 
background than to a white (high 
intensity) card on a white background 
(Nygaard, 1958). It is exactly an 
effect such as this, on the contrary, 
which is predicted by the Perkins- 
Logan theory. A further difficulty in 
the application of Hull's theory to the 
present data is its limitation to short- 
latency responses and its failure there- 
fore to explain such effects as have 
been obtained with the GSR. The 
two main results of Experiment I were 
the demonstration that "silence" is 
comparable in effectiveness with an 
audible tone, as a CS, and that the 
response elicited by a brief period of 
"silence" is stronger with a loud tone 
as the intertrial stimulus than with a 
weak tone between trials. The former 
result is comparable with the common 
finding in discrimination learning 
studies that, with a white card and a 
black card presented simultaneously, 
for example, either may be made the 
positive” cue in the sense that ap- 
proach to it is followed by reinforce- 
ment, provided allowance is made for 
mitial position preferences. The sec- 
ond result of Experiment I is presum- 
ably analogous with the equally 
common finding that rate of discrimi- 
nation learning is directly proportional 
to the degree of difference between the 
positive and negative cues, for ex- 
ample, that learning proceeds more 
quickly with white and black cards as 
the cues than with white and gray or 
with black and gray. : 
The results of Experiment I suggest 
a further prediction about generaliza- 
tion effects which follows the applica- 
tion of the Perkins-Logan theory to 
the stimulus-offset situation. It is 
first necessary, however, to deal with 
these effects as they occur in the more 
usual case of stimulus onset (Figure 
1). The clearest evidence on stim- 
as prob- 


ulus-intensity generalization h 


ably been provided by Grice and 
Saltz (1950), who found asymmetrical 
gradients falling rapidly with test 
stimuli weaker than the training stim- 
ulus and more slowly for stronger test 
stimuli. Examination of Figure 1 
shows that, according to the present 
formulation of the  Perkins-Logan 
theory, tests with stronger stimuli 
should not yield a weaker response, 
as net excitatory potential stays 
constant. The reason why this de- 
duction does not correspond with the 
facts may well lie in the assumption of 
linear gradients of generalized excita- 
tion and inhibition. If negatively 
accelerated gradients are substituted 
for the linear gradients of Figure 1 
then tests with stronger stimuli should 
also produce a weaker response. The 
gradients of net excitatory potential 
should still be asymmetrical; Hull 
(1949) predicted this asymmetry by 
allowing for stimulus dynamism, but 
the substitution of negatively acceler- 
ated for linear gradients allows the 
Perkins-Logan theory to predict the 
same outcome. According to the 
latter. theory, however, when the 
eliciting and background stimuli do 
not lie on any common dimension, 
then the gradients of net excitatory 
potential should be symmetrical about 
the value of the eliciting stimulus. 
This lack of communality and the 
consequent symmetry are more likely 
with qualitative differences than with 
changes in stimulus intensity. 

The application of these considera- 
tions to the case of stimulus offset 
gives rise to the prediction which is 
still untested. When the ambient 
noise level ("silence") of, say, 50 db 
is used as the CS then tests of stim- 
ulus generalization may be made by 
briefly changing the intensity of the 
intertrial stimulus to some value 
other than 50 db. For example, the 
intertrial stimulus could be switched 
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from 80 db to 90 db or to 70 db for 
2000 milliseconds (see Case B of 
Figure 2). With negatively accelerated 
gradients assumed, a momentary in- 
crease or decrease in the intensity of 
the background stimulus should pro- 
duce a response which is always 
weaker than the original CR but rela- 
tively stronger with the decrease than 
with the increase. Momentary de- 
creases in the intensity of the intertrial 
stimulus below the intensity of the CS 
(50 db), however, should produce 
another gradient of net excitation. 
Thus a plot of the curve relating net 
excitatory potential to Stimulus in- 
tensity should 
inflection, one at the value of the CS 
and another at the value of the inter- 
trial stimulus. This effect would be 


as well as the 


intertrial 
however 
weak, then the Same effect should be 
would be 


technical 
reasons. 


Mopirrep HuLLIAN THEORY 


The results of Experiment I showed 
that a strict, literal interpretation of 
Hull's principle of stimulus-intensity 
dynamism must be abandoned, but 
they also suggest a modification by 
which Hull's approach to the Problem 
might be sustained. Along with all 
other S-R theorists Hull assumed that 
a stimulus acts through its Occurrence 
or presence and this is the source of 
difficulty in dealing with the cessation 
or “absence” of a stimulus as an active 
factor. The difficulty could be over- 
come with the supposition that a 
stimulus acts through any change in 
the intensity of the physical energies 
impinging upon the organism's recep- 


tors and that the effectiveness of the 
stimulation varies with the degree of 
change, regardless of its direction. An 
assumption of this kind is supported 
by physiological evidence for the 
existence of nerve fibers which respond 
only to changes in stimulation (e.g. 
Granit, 1947). Thus it is possible to 
contrast a neo-Hullian or modified 
stimulus-intensity dynamism principle 
with the differential conditioning 
theory of Perkins and Logan, and 2. 
Second experiment was designed with 
this aim in view. In its revised form 
the stimulus-intensity dynamism prin- 
ciple should ascribe equal effectiveness 
to the onset and the offset of a stim- 
ulus, provided that degree of change is 
held constant, and should not lead one 
to expect any effect with a shift from 
onset to offset (or vice versa) as the 
Stimulus over à series of trials with the 
Same subject. According to the dif- 
ferential conditioning theory, on the 
other hand, once a given stimulus has 
acquired inhibitory Properties for a 
given subject then negative transfer 
should occur if the same stimulus is 
then used to elicit the response in the 
Same subject, as happens with a shift 
t or vice versa. In 
order that due allowance might be 
made for the limitation of the Hullian 
formulation to short-latency re- 
Sponses, this experiment was con- 


ducted with 4 key pressing movement 
instead of the GSR. 


EXPERIMENT II 


The subjects were 12 members of under- 
graduate Courses at 


: te 
Sasured with an electroni 


d Each Subject was presente’ 
With the four combinations of stimulus condi- 
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tions in blocks of nine trials, viz., weak tone 
offset (A), strong tone offset (B), weak tone 
onset (C), and strong tone onset (D). With 
subjects numbered in order of testing, they 
underwent the four conditions in one session 
in the following orders: Subjects 1, 5, and 9— 
CDAB; Subjects 2, 6, and 10—DCBA; 
Subjects 3, 7, and 11—ABCD; Subjects 4, 8, 
and 12—BADC. Before each block of trials 
the subject was told what the nature of the 
signal was to be and was simply asked to press 
the key as quickly as possible upon hearing 
the signal. A verbal ready signal was given 
Over an intercom approximately 2 seconds 
before each stimulus signal and the duration 
of the occurrence or absence of the tone was 
2 seconds. The interval between trials was 
approximately 15 seconds. 

The design of the experiment was such that 
the scores of the 12 subjects could be averaged 
for each trial irrespective of the conditions 
under which they were tested, so as to cancel 
out the main effects due to individual differ- 
ences and to amount and direction of change 
in stimulus intensity as the signal. The 
adoption of this procedure left not only 
Practice as a main effect but also shifts in the 
nature of the signal. Between Trials 9-10, 
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18-19, and 27-28 there were shifts in degree 
of change in stimulus intensity (small to 
large or large to small), but between Trials 
18-19 there occurred a shift in direction of 
change in stimulus intensity (onset to offset 
or offset to onset) as wellas in degree. Mean 
performance for the ordinal series of trials is 
depicted in Figure 4 and it is there clear that 
whereas the shifts in degree of change simply 
merged with the general practice effect, the 
shift in direction as well as degree caused 
considerable negative transfer. Wilcoxon’s 
rank test was applied to a comparison of the 
scores on Trials 18 and 19 and this showed 
that the negative transfer was significant 
(22 3.43, p «.001) These results are 
interpreted as clearly supporting the Perkins- 
Logan theory as against a modified stimulus- 
dynamism postulate. 

The findings of the first experiment and 
other studies were also confirmed in the data. 
Working with the total reaction time over the 
last five trials in the block of nine for each 
subject, in order to reduce the practice effects, 
rank tests were used to assess the main effects 
of weak versus strong stimuli and of onset 
versus offset as a signal. The former effect 
was significant (s = 3.19, p < .01) but the 
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Fic. 4. Performance in Experiment II as a function of the ordinal number of the trial 


regardless of condition of testing. 
Signal occurred between Trials 9-10, 
a shift in direction of change.) 


(Shifts in degree of change in stimulus intensity as the 
18-19, and 27-28, but between Trials 18-19 there was also 
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m e ir hus, Po md disp e 
pada the case of rats, strength of response 
varied with amount of change in stimulus in- 
tensity, regardless of the direction of the 
change. 

The results of the second experi- 
ment nullified the attempt to modify 
Hull’s treatment of the stimulus- 
intensity phenomenon and lent further 
weight to the Perkins-Logan theory. 
Apart from the direct disproof of 
the modified stimulus-dynamism prin- 
ciple, the differential conditioning 
theory should be preferred as the more 
parsimonious, for it avoids the as- 
sumption that a stimulus acts through 
change in intensity rather than its 
mere presence. This is not to deny 
rapid adaptation, so that a stimulus is 
most effective at the moment of onset, 
but it does imply that mere offset 
cannot be regarded as having stimulus 
properties. In terms of the Perkins- 
Logan theory, offset is effective be- 
cause it inevitably means the simul- 
taneous onset of some other physical 
energy as far as receptor processes 
concerned. It should be noted that 
the terms “offset” and “onset” need 
not imply any direct intervention on 
the part of the experiment, as in the 
present experiments, but may be 
applied to changes in the energies 
affecting the receptors which are in- 
duced by the organism's own move- 
ments, as in instrumental conditio: 
or discrimination learning. 

The use of such a precise and well. 
controlled response as key pressing by 
humans in this experiment has drawn 

attention to a point which may well 
demand further study if the differ- 
ential conditioning theory is to find 
wide acceptance. The theory states 
that background or intertrial stimuli 
acquire inhibitory properties with re- 
spect to a given response when that 
response occurs in their presence with- 
out reinforcement. Although it was 
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observed in Experiment I that GSRs 
commonly occurred between trials 
early in training, there were relatively 
few cases of key pressing to the inter- 
trial stimulus in Experiment ll. 
Some subjects who did respond before 
the signal of tone onset or offset 
apologized to the experimenter or acted 
as if they had behaved foolishly; the 
intertrial GSRs, on the other hand, 
would have passed undetected by the 
subject. Thus it may be necessary 
to assume that with such “voluntary 
responses as key pressing it is enough 
for the response to occur implicitly 
without reinforcement for inhibition 
to accrue to the stimulus present at 
the time. This raises the further 
problem of the nature of the reinforce- 
ment for learning to give a ‘‘volun- 
tary” response more and more quickly 
under instruction, No attempt will 
be made to enter into a discussion of 
such issues here, but it is clear from 
Figure 4 that learning did occur in a 
Situation of this type and the immedi- 
ate drop in response strength with a 
reversal of signaling and intertrial 
stimuli is in keeping with the Perkins- 
-ogan theory as it is formulated at 
present, 


Referring again to the comparison 


of the stimulus-intensity situation 
with that of discrimination learning 
the results of Experiment II permit à 
Parallel to be drawn. This is to the 
effect that in discrimination learning 
With white as the positive and black a5 
the negative Cue, negative transfer !$ 
Obtained With a reversal to white 2$ 
negative ang black as positive after 
Some training. Jn terms of Spence ê 
(1937) analysis of the transpost 
tion Phenomenon, furthermore, there 
Should be Some loss in performance 
with a shift from white to gray as epe 
Positive cue, with black negative 
throughout, or from black to graY b 
the positive cue with white constantly 
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negative, because the difference be- 
tween the positive and negative cues 
is less after the shifts; conversely, 
there should be gains in performance 
with shifts of the positive cue in the 
reverse direction, i.c., from gray to 
white with black negative or from 
gray to black with white negative, 
there being a greater difference between 
the positive and negative cues after 
the shifts. Comparable with these 
findings or expectations are the facts 
that the reversal of signaling and 
intertrial stimuli in Experiment II led 
to a marked drop in performance, and 
that level of performance depended on 
the degree of difference in intensity 
between these two stimuli. The lat- 
ter feature of Experiment Il did not 
show up in Figure 4, of course, because 
the design counterbalanced large and 
small intensity differences at succes- 
sive shifts, but it was clearly evident 
when degree of change was studied as 
a main effect, as in Experiment I also. 


STIMULUS INTENSITY AND 
Drive STRENGTH 


The third and fourth studies of this 
series were conducted with a common 
aim, the determination of the general 
nature of the joint effect upon response 
strength of variations in stimulus in- 
tensity and drive strength. As noted 
at the outset, Hull assumed a multi- 
plicative relationship between stim- 
ulus-intensity dynamism (V) and 
drive strength (D), but the Perkins- 
Logan theory should allow a deduction 
to be made in this respect. If the 
excitation accruing to the eliciting 
stimulus and the negative effects of 
generalized inhibition are set down in 
a conventional way, then the deter- 
mination of the strength of the 
response in a variety of situations may 
be represented as 


R= A - für Xx D-1D 


where Æ is net excitatory potential, M 
is habit strength, D is drive strength, 
and Z is inhibition (Spence, 1960). 
The application of this theorizing to 
the case of a response which is elicited 
by both weak and strong values of the 
stimulus, with JZ held constant, leads 
to the deduction that variations in the 
strength of the eliciting stimulus (i.e., 
in degree of inhibition) will act addi- 
tively with variations in D. It is this 
deduction from the Perkins-Logan 
theory, opposed as it is to Hull's 
assumption, which was tested in the 
following two experiments. Because 
of its greater sensitivity to motiva- 
tional effects and its “involuntary” 
nature, use was again made of the 
GSR. A factorial design was used to 
test for the additive or multiplicative 
relationship, it being assumed that the 
latter would be reflected in significant 
interaction between the variables. 
Instead of using stimulus offset as 
the CS in a situation requiring new 
learning (as in Experiment I), capital 
was made of the fact that, due either 
to transfer from past experience or to 
the nature of the innate reflex, most 
subjects give a GSR to a tone without 
formal training, the response to a 
strong tone being greater than that to 
a weak tone. While many subjects 
also give GSRs between trials at first, 
the intertrial responses soon fall to a 
minimum frequency. The experiment 
therefore proceeded straight to an ex- 
amination of the relation between this 
existing stimulus-intensity effect and 
drive strength. Variation in drive 
strength was achieved through the 
strength of shocks administered at 
random among the test presentations 
of the weak and strong tones, a tech- 
nique used in a number of recent 
studies for this purpose (e.g., Spence, 
Haggard, & Ross, 1958). The re- 
peated presentation of a tone as an 
eliciting stimulus for the GSR without 
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reinforcement, however, produces the 
decrement in performance usually 
ascribed to either or both of the proc- 
esses of adaptation and experimental 
extinction. In order to carry out a 
sufficient number of test trials (tone 
alone) it was necessary in one of the 
two experiments to interpolate train- 
ing sessions among test sessions, The 
training sessions consisted of regular 
conditioning trials, with tone as CS 
and shock as UCS, The separate 
test series in this case each required an 
appropriate combination of tone 


ength, but care 
ining series to 


¢ OSting" session 
included all combinations of CS and 


S intensity in order that no arti- 


follows. Let Eww, Ews, 
Ess represent the degree of net excita- 
tory potential generated with the 
combinations of weak tone and weak 
shock, weak tone and Strong shock, 
strong tone and weak shock, and 
strong tone and strong shock, respec- 


Esw, and 


tively. Then 
Eww = H X D— =I 
Ews = HX D+ wets 
Esw = HX D— — ]— 
and 


Ess = HX D+ — T— 


where Z is constant, D+ and D— are 
due to strong and weak shock, re. 
spectively, and 7+ and I— are the 
values of the generalized inhibition 
present with weak and strong tones, 
respectively, as eliciting stimuli. This 
state of affairs is represented in 
Figure 5. 
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Pig, 5. The application of the Perkins- 


Logan theory to the covariation of stimulus 
strength (tone) and drive strength (shock). 
(The broken line indicates the gradient of 
inhibition, originating at the intertrial stim- 
ulus—O—and generalizing to the weak tone— 
1-4-—and Strong tone—/ —. The E values 
are for the four combinations of weak or 
Strong tone— first subscript—with weak or 
strong Shock—second subscript.) 


The use of a factorial design allowed 
a test of an additive relationship in 
the absence of interaction between the 
Variables, Ag Set out in Figure 6, 
this test may be made by comparing 
the difference between the mean GSRs 
for Strong and weak shock with a 
Weak tone (WS-WW in Figure 6) with 
the difference between means for 
strong and weak shocks with a strong 
tone (SS-SW), "he comparison may 


EXCITATION OR INHIBITION 


Fic. 6, 
additive 


relationship 


(The letters indicate 


mbinations of weak or strong tone 
rst letter—, 


with Weak or strong shock— 
Second letter) 
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also be made between the differences 
for strong and weak tone with a weak 
shock (SW-WW) and for strong and 
weak tone with a strong shock 
(SS-WS). The specific predictions 
from the Perkins-Logan theory are 
obtained thus: 


Ews S» Eww a " 
= JID+ —I+ —HD-—+1+ 
= [I(D+ — D—) 

and 


Ess = Esw = e 
= HD+ —I- —HD-—-1— 
= H(D+ — D—) 


when it is to be expected that the 
differences will be equal and that 
there will be an additive relationship 
such as is depicted in Figure 6. It 
also follows that both Esw — Eww 
and Ess— Es are equal to [+ —-I-. 


EXPERIMENT III 


The subjects were one man and one woman. 
The apparatus differed from that used in 
Experiment I in the following respects. The 
eliciting stimulus was a 2000-millisecond, 
2000-cps sine-wave tone delivered to the sub- 
ject through headphones at 60 db (weak tone), 
75 db (medium tone), or 90 db (strong tone) 
SPL. ‘The noxious stimulus used to induce 
drive was a 500-millisecond, 90-cps, spike- 
wave shock delivered through copper elec- 
trodes in the form of spring clamps $ inch 
wide which almost encircled the third and 
fifth fingers of the right hand near the finger- 
tips, and it was set at 2.27 (weak shock), 3.20 
(medium shock), or 4.80 (strong shock) milli- 
amperes. After being placed in the apparatus 
the subject was merely told to keep still and 
warned of the first shock and the first tone. 
Both subjects underwent a number of test 
sessions at the rate of one each day at the same 
hour (11:00 A.3.-12:30 P.M.) and each session 
consisted of two shock trials, six tone and six 
shock trials in random order, and one tone 
trial Shock strength and tone strength were 
held constant throughout any one test session 
but were varied from session to session in 
accordance with the basic aim of the experi- 
ment. The test sessions for the female sub- 
ject involved the six combinations of week 
(W), medium (M), and strong (S)tonewit 


weak (W) and medium (M) shock, and the 
order of combinations over sessions was WW 
(weak tone and weak shock), SM, WM, SW, 
MW, and MM. This order did not seem to 
allow any biasing effect due to adaptation and 
if anything appeared to favor the occurrence 
of a direct multiplicative relationship. The 
test sessions for the male subject involved the 
four combinations of weak and strong tone 
with weak and strong shock. In this case it 
was possible to use the more desirable ABBA 
sequence and the combinations were used in 
the order SS, SW, WS, WW, WW, WS, SW, 
and SS. Twenty-four hours before each test 
session (following the previous test session) 
the subject was given a training series of nine 
conditioning trials (tone-shock with a 500- 
millisecond interstimulus interval) to combat 
adaptation and extinction effects, and each 
training series included all the combinations 
of tone strength and shock strength. 


The data for each subject were 
treated separately and they are pre- 
sented in Table 2. The results of an 
analysis of variance are set out in 
Table 3. In neither case did the tone- 
shock interaction, to be expected with 
a multiplicative relationship, prove 
statistically significant. 


EXPERIMENT, IV 

The subjects were 20 volunteers (9 men and 
11 women) from an introductory psychology 
course at the University of Sydney. With 
two exceptions the apparatus and stimuli were 
identical with those used in Experiment IH. 
The copper shock electrodes were replaced 
with a type similar to those used to pick up 
the GSR, and only the 1.60-milliampere and 
3.20-milliampere shocks were used, being des- 

SST 
TABLE 2 


Mean GSR ro TONE (micromhos) 
IN EXPERIMENT. III 


uam 


Shock strength 


mondd oa 23 c2 NNNM 


Subject | strength 


Strong | Medium | Weak 

Strong — 4.79 2.60 

Female | Medium — 2.74 1.26 
Weak — 2.91 0.16 

Male Strong 1.30 0.27 
Weak 0.52 — 0.07 


TABLE 3 


ALYSIS OF VARIANCE 
XPERIMENT III 


SUMMARY OF / 
or GSRs ix 


Subject Source df | MS F 
Tone (A) 2 | 1797 | 3.24 
Female | Shock (B) 1 | 4864 | 8.78** 
AXB 2 139 | 0.25 
Residual 36 | 554 
Tone (A) 1| 340 | 4.05* 
Male Shock (B) 1 757 | 9.01** 
AXE 1 109 | 1.30 
Residual 52 84 
x -05. 
sb EO 


ignated weak and strong, respectively, hence- 
each subject were 


i ‘isting of one medium 
tone trial, two shock trials, and, in a different 


random order for each subject, five weak tone, 
five strong tone, and eight shock trials. The 
subjects were matched in pairs on the basis 


and the other with 
whereas shock strength was constant d 
the session for any one subject, both s 


trong 
tone and weak tone trials were given to all 
subjects, 


Because each subject was tested in 
one session, with only 11 tone alone trials, 
adaptation and extinction effects were rela- 
tively slight and no training trials were used. 
Once again use was made of the facts that 
subjects enter this situation with a tendency 
to respond to the tone and that intertrial re- 
sponses are soon inhibited. In view of the 
matching of subjects across shock strengths 
the data were treated as if the same subjects 
had undergone each combination of stimulus 
strengths and the results of an analysis of 
variance are shown in Table 4. Again the 
tone-shock interaction was not significant, as 
might be expected from an inspection of the 
data in Table 5. 


The data of Experiments III and IV 
confirmed the deduction from the 
Perkins-Logan theory and were in- 
compatible with the Hullian assump- 
tion of a multiplicative relationship 
between stimulus strength and drive 
strength. Another aspect of Walker's 
(1960) study, cited earlier, produced 
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TABLE 4 


SUMMARY oF S OF VARIANCE 


or GSRs iN 


Source | dí | MS I 
Tone (A) | 1 | 42.69 
Shock (B) | 1 2.92. 
Subjects (C) | 9 4.67 
AXB 1 | 0.04 
AXC ç 5.31 
B XC $- 4.52* 
AXBXC |/ Astrener 

— 1. = 
* p «.05. 
E i 


a similar finding, Half the subjects 
Were trained with a weak UCS (air 
puff) and half with a strong UCS, and 
if this noxious UCS is thought of as a 
Source of drive then her experiment 
also allowed a test of the relationship 
between stimulus intensity and drive 
Strength. Walker found significant 
differences in acquisition due to both 
CS intensity and UCS intensity, but 
there was no interaction betwee? 
these variables, This result cannot 
be taken as clear-cut, for the superior 
reinforcement of a strong UCS 
may have produced superior learning 
(Spence et al., 1958), as the strong 
'S may also have done, If either of 
these latter factors had produced 
differentia] habit strengths then 
a multiplicative relationship should 
have been obtained; the reason for 
this expectation emerges from a com- 
Parison with discrimination learning 
which follows shortly, 


TABLE 5 
MEAN GSR TO Tone (micromhos) 
IN Experiment IV 


^ ——— 
| Shock strength 
Tone strength CMM — 
Strong Weak 
Strong 251 | 2.08 
Weak 0.98 0.63 
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At least two other studies have pro- 
duced evidence for Hull's formulation 
and some reconciliation must be made 
of these findings with the present data. 
First, with reference to an experiment 
on eyelid conditioning, Spence (1956) 
has reported a significant interaction 
between CS strength and anxiety level 
at asymptotic values of response 
strength. Second, Castaneda (1956) 
found the same type of relationship to 
hold for the latency and amplitude of 
a manual response, again with mani- 
fest anxiety as the drive variable. 
A common property of these two 
Studies was the use of anxiety as a 
means of manipulating drive strength 
and the particular form of the data 
might be explained in terms of the 
"reactive" hypothesis of anxiety, i.e., 
with the idea that the difference be- 
tween high and low anxious subjects 
from the motivational point of view is 
only apparent when they are pre- 
sented with a ‘‘strong”’ eliciting stim- 
ulus or, in terms of the Perkins-Logan 
theory, with a stimulus which evokes 
little generalized inhibition. Accord- 
ing to this view the multiplicative 
effect is obtained with anxiety as a 
drive variable because the effective D 
for the high anxious subjects is 
greater with a strong than with a 
weak eliciting stimulus. In the pres- 
ent experiments, on the other hand, 
the after effects of shock (D) are the 
same for strong and weak eliciting 
stimuli (tones), the reactive hypothe- 
sis does not apply, and the relationship 
under study takes the simpler additive 
form. It should also be noted that 
the experiment quoted by Spence was 
also complicated by the possibility 
that true learning was superior with 


the stronger CS. 
It may be sug 
tionale of Experiments 
invalidated by the use of the GSR. 
In Experiment I it was found that 
effects predicted by the differential 


gested that the ra- 
III and IV was 
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conditioning theory could be obtained 
with the GSR, so there can be no 
objection to its long latency as such, 
but a further difficulty may be seen 
in the fact that the GSR has often 
been used as a measure of emotion. 
Spence (1956) has assumed that an 
emotional response of an implicit 
nature (re) underlies the effects of aver- 
sive motivation (D) as generated by 
shocks, and the GSR might be taken 
as reflecting the strength of this emo- 
tional activity. If this argument be 
extended to allow that any stimulus, 
including a tone, arouses some emo- 
tional activity, then the use of the 
GSR in these experiments on the 
combined action of the tone and the 
aftereffects of shock might simply 
reflect the summation of two forms of 
emotional activity. There are at 
least two answers to this objection. 
In the first place, no theorist has 
supposed that a weak tone is motivat- 
ing and it should follow that the 
granting of motivational properties to 
a strong tone only leads to the ex- 
pectation of a multiplicative relation- 
ship between tone strength and shock 
strength. Second, it is likely that the 
latency of the implicit emotional or 
fear response is too long to affect even 
the relatively long-latency GSR. For 
the present purposes, therefore, it 
seems appropriate to treat the GSR 
as any other response within the 
framework of the differential condi- 
tioning theory, but to capitalize on its 
long latency in the manner indicated 
earlier. 

Another point which must be con- 
sidered is the methodological criticism 
recently levelled by Miller (1959) at 
the present type of study, which seeks 
the form of a relationship by simply 
taking representative values instead 
of covering the entire range of both 
variables. Miller's point is that the 
form of the relationship may vary 
with the location of the sampled 
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values within the entire range, and 
this is true if one of the variables is 
the number of reinforced trials (JV), 
which causes response strength to 
increase in the form of a negative 
growth function or some kind of non- 
linear progression. In the present 
instance, however, both D and I are 
assumed to bear a linear relationship 
to shock strength and tone strength, 
respectively, and it should not matter 
what values are chosen for experi- 
mentation within limits set by the 
introduction of physical damage fac- 
tors with extremely strong shocks and 
loud tones. Even if a negatively 
accelerated relationship were assumed 
to hold between J and tone strength, 
as is more likely, then the deduction 
of an additive effect made above 
should still be valid, for the differ- 
ences in E values for weak and strong 
Shock at various tone strengths are 
not affected by the nature of this 
relationship and should Stay constant 
(Figure 5). Quite apart from this 
argument it is felt that the values 
used in the present experiments cov- 
ered the range to a reasonable degree; 
as Table 2 shows, the weak tone-weak 
shock combination was so weak as to 
produce a barely measurable response 
in the male subject on most trials, the 
mean value being .07 micromhos, 
whereas the strong tone-medium shock 
combination produced large responses 
with a mean of 4.79 micromhos in the 
female subject. : 

A second methodological factor 
which might affect the validity of any 
attempt to determine the nature of 
the joint effect of two variables in 3 
theoretical system is the unit o 
measurement to be used for the de- 
pendent variable. The iar ge 
from the Perkins-Logan theory oi an 
additive relationship between ea 
ulus strength and drive m 
(independent variables) as nee a 
terminers of net excitatory poter 
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(intervening variable) was tested 
through the amplitude of the GSR 
(dependent variable). The link be- 
tween net excitatory potential (£) and 
amplitude of GSR thus becomes cru- - 
cial if the additive effect deduced to 
apply to the former variable is to be 
reflected in the latter variable. It is 
fortunate, therefore, that evidence of 
a simple linear relationship between 
these two variables is available 
(Spence, 1956, pp. 113-116). The 
nature of the scales for quantifying H, 
D, and / need not enter into the 
question, for the expression in these 
terms distinguishing response strength 
at the two drive levels is identical for 
weak and strong eliciting stimuli. 
While amplitude of response may 
therefore be used with confidence asa 
measure in the test of the deduction 
from the differential conditioning 
theory, other measures of response 
strength may not be equally appro- 
priate. The present state of behavior 
theory leads to the deduction that 
frequency of response is a norma 
ogival function of superthreshold ex- 
citatory potential, in which case the 
evidence of Spence (1956) and Walker 
(1960) with frequency of conditioned 
response (CR) as a measure of ^ 
may be suspect. Latency of response 
should theoretically bear the same 
relationship to E as amplitude as & 
measure, so that the data of Castaneda 
(1956) which were also cited above ate 
less open to this criticism. 

Further comparisons may be made 
with discrimination learning. In de 
riving the prediction of an additive 
relationship from the Perkins-Log@” 
theory, usé was made of the fof 
mulation 


R=f(E) =f(Hx D-/). 


It will be noted that D is assumed 
only to multiply H and not J. » 

is a long-standing. assumption, pa n 
by both Hull and Spence, and ! 
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supported-by a wide variety of data. 
In the usual discrimination learning 
situation the learner is presented with 
two cue stimuli on each trial, with 
the response to one cue reinforced and 
to the other not reinforced. In these 
circumstances the positive cue should 
acquire habit strength (H) which 
generalizes to the negative cue (/7), 
while the negative cue acquires in- 
hibitory potential (Z) which gen- 
eralizes to the positive cue (J). If 
some uniform background is present 
throughout the experiment, and this 
background stimulus lies on the same 
dimension as the positive and negative 
stimuli, then it too will acquire in- 
hibitory properties (Jz) which gen- 
eralize to the positive stimulus (s+) 
and to the negative stimulus (is). 
The application of the formula to give 
the net excitatory potential of the 
positive stimulus (E+) and the nega- 
tive stimulus (E—) then gives the 
following expressions: 


EJ---HxD-(-4Ia) 
and 
E— — H x D — (I 4 In) 


when it follows that, if the two E 
values under high drive (D+) be 
represented as E+-+ and E— +, and 
Due low drive (D—) as E+ — and 


E++ — E-+ =D+(H—#H)-k 
and 
E+- —E-- =D-(H-#)-—k 


where is a constant representing the 
total of the four sources of inhibition. 
Now D+ is greater than D—, so it 
has been deduced that discrimination 
learning under the stipulated condi- 
tions should be superior with high 
drive, The other derivation analo- 
gous with that made for the stimulus 


— intensity situation requires a compari- 
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son of the E values for the positive 
stimulus with high and low drive 
(E++ — E+-—) and the negative 
stimulus with high and low drive 
(E—-- — E— —), as follows: 


E++ — E+- = H(D+ — D—) 
and 
E-+ —E-— =H(D+ — D—) 


and since Z7 is greater than H, because 
the latter has generalized from the 
former, it is to be expected that there 
will be interaction between the degree 
of separation of the stimulus cues on 
the one hand and drive level during 
training, on the other hand. Refer- 
ence back to the deductions about the 
joint effects of stimulus intensity and 
drive strength which were tested in 
Experiments III and IV will show 
that the assumption of constant habit 
strength is there crucial, for any 
difference in this respect between 
strong and weak eliciting stimuli will 
lead to a multiplicative rather than 
an additive relationship with drive 
strength. Of equal importance is the 
limitation of drive effects to habit 
strength and the supposition that 
drive does not multiply inhibition; 
modification of the basic formula to 
read 


R=f(E) -fDXH-—DX1) 


would also lead to the prediction of a 
multiplicative effect. 

In a recent review of drive effects in 
spatial discrimination learning, Brown 
(1961) has reached a conclusion which 
tallies exactly with the deductions 
made above, i.e., that when the habit 
strengths of the response tendencies 
for the positive and negative cues are 
held equal, drive differences do not 
affect percentage choice of the correct 
response; if a correction method of 
learning is used, or if the two responses 
are reinforced with differential fre- 
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quencies, then drive effects become 
apparent. Brown notes, however, 
that there are too few relevant data to 
allow any such firm conclusion in 
the case of nonspatial discrimination 
learning. If anything in the latter 
case, performance improves with in- 
creased motivation, and this may be 
due to the greater difficulty in keeping 
habit strengths equal in the non- 
spatial situation with forced trials. 
It should also be noted that many of 
the studies cited by Brown involve 
instrumental approach learning, and 
that under these conditions the mere 
occurrence of the response, with or 
without reward, is assumed to produce 
an increment of habit strength; for 
this reason, forced runs to the non- 
rewarded side of a T maze, for ex- 
ample, are thought to produce as 
much learning as a free run to the 
rewarded side, and the increasing 
preference for the rewarded side is 
explained in terms of incentive moti- 
vation (Spence, 1956). It is therefore 
as well to emphasize the earlier caution 
that the analogy between the mechan- 
ism of the Perkins-Logan theory and 
discrimination learning is inexact, and 
that stimulus-intensity effects are 
more safely treated as due to differ- 
ential classical conditioning, 
Before concluding this discussion of 
the interrelated effects of stimulus 
intensity and drive strength, attention 
must be drawn to some recent com- 
ments by the author on drive effects 
in conflict (Champion, 1961). In 
order to attempt a resolution of an 
apparent inconsistency in the S-R 
treatment of conflict, it was assumed 
that changes in the strength of sep- 
arate drives, such as hunger and fear, 
not only produce a common result by 
contributing to overall drive level, 
according to the generalized 
hypothesis, but also lead to a 
guishable effects of a ps mre 
through the dynamism ot the 
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stimuli. Thus it was supposed that 
an increase in the hunger of a rat m 
approach-avoidance conflict affects 
the generalized habit strength re- 
sponsible for the gradients of both 
approach and avoidance (generalized 
drive) and also acts selectively on 
the approach gradient through the 
strength of the hunger-drive stimulus. 
Although the evidence presented in 
the present account of stimulus- 
intensity effects would seem to force 
one to discard the assumption of a 
stimulus-dynamism principle, the facts 
of stimulus-intensity changes remain 
unchallenged and it is still possible to 
hypothesize that a stronger drive 
stimulus elicits a stronger response, 
other things being equal. As for the 
explanation of such a process, the 
substitution of the dimension of drive 
strength for that of the intensity of 
the eliciting external stimulus may 
pose further problems. It would have 
to be assumed that the response under 
study is not reinforced when the or- 
ganism is satiated, so that satiation 
stimuli take on inhibitory properties. 
This latter inference is similar to some 
recent proposals by Estes (1958) 
More generally, however, the present 
line of argument preserves the purely 
motivational or energizing property 9 
drives, while allowing them stimulus 
properties as well, whereas Estes 
appears to be moving towards. i: 
purely associative explanation of drive 
effects. 

An experimental test of these 
alternatives might be conducted along 
the following lines. Suppose that ê 
group of rats was trained to perfor 
some act such as lever pressing at 
hours food deprivation; the us¢ “i 
some reinforcement other than for : 
would make this learning technica A 
easier to obtain. Suppose also i 
the same animals were regulan 
placed in the apparatus at 12 ho 6 
deprivation but that the response 
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lever pressing was never reinforced at 
this level. Considering only the role 
of the drive stimuli and assuming that 
the effect of drive is purely associa- 
tive, the application of the Perkins- 
Logan theory to this form of differ- 
ential training is set out in Figure 7. 
This figure differs from preceding 
representations in that habit strength 
rather than gross excitatory potential 
is plotted on the ordinate, along with 
inhibition. The lever pressing re- 
sponse is reinforced at 4 hours 
deprivation, and 6 units of habit 
strength are arbitrarily assigned to the 
drive stimuli at that level (Z7 = 6); 
because of drive stimulus generaliza- 
tion, 5 and 4 units of generalized habit 
strength (/7) are assigned to the drive 
stimuli at 8 and 12 hours deprivation, 
respectively. The response is also 
allowed to occur at 12 hours depriva- 
tion, but it is never reinforced at that 
level, so that enough inhibition (Z) 
accumulates to cancel the generalized 
habit strength, viz., 4 units; through 
drive stimulus generalization again, 
3 and 2 units of generalized inhibition 
(I) are assigned to the drive stimuli 
at 8 and 4 hours deprivation, re- 
spectively. Two methods of calculat- 
ing net excitatory potential are then 


HABIT STRENGTH OR INHIBITION 


o n 8 12, 
HOURS OF DEPRIVATION 


Fic. 7. The application of the Perkins- 
Logan theory to a hypothetical situation in 
which a given response is reinforced at 4 hours 
deprivation of food but not reinforced at 12 
hours deprivation. (It is assumed in this 
instance that the effects of drive are purely 
Associative.) 


TABLE 6 


ALTERNATIVE DERIVATIONS OF NET 
EXCITATORY POTENTIAL FROM 
VALUES ARBITRARILY 
ASSIGNED IN 


FIGURE 7 
Hours of deprivation 
4 8 12 
Habit strength (Hor H)| 6 5 4 
Drive strength (D) 2 4 6 
HXD 12 20 24 
fE=H-T 
T equals 2 3 4 
E equals 4 2 0 
fE=HXD-I 
T equals 12 18 24 
E equals 0 2 0 


compared in Table 6. If a purely 
associative theory of drive is used 
(Figure 7), then net excitatory poten- 
tial (E) should decrease with depriva- 
tion time, as shown in Table 6 when 
E = H-—TI. If, on the other hand, 
deprivation brings generalized ener- 
gizing effects (D) as well as drive 
stimuli (Sp), and if E = H x D — I, 
then more inhibition will have to be 
generated to stop the response occur- 
ring at 12 hours deprivation, and net 
excitatory potential should reach a 
maximim at some period of depriva- 
tion between 4 and 12 hours (Table 6). 


CONCLUSIONS 


Taken collectively, the results of 
the present experiments confirm the 
Perkins-Logan theory of stimulus- 
intensity effects in terms of differential 
conditioning and are incompatible 
both with Hull's original formulation 
and the modification proposed by the 
author. Thus the feature of the 
stimulus situation which seems to be 
directly responsible for the phenomena 
under study is the relative disposition 
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of the eliciting stimulus and the back- 
ground or intertrial stimuli on some 
common dimension of generalization. 
The Perkins-Logan theory is also 
meaningful in the sense that it permits 
the generation of a number of predic- 
tions which remain to be tested. The 
denial of Hull's stimulus-intensity 
dynamism postulate and the verifica- 
tion of the differential conditioning 
theory, however, does not jeopardize 
the general principles of the type of 
behavior theory which he and his 
co-workers established. On the con- 
trary, the differential conditioning 
theory follows directly from some 
general features of Hull’s behavior 
system, based as it is on the accepted 
principles of excitation following rein- 
forcement, inhibition following nonre- 
inforcement, stimulus generalization, 
and the algebraic summation of 
excitation and inhibition. What Per- 
kins and Logan have done is imply 
that an assumption of stimulus-inten- 
sity dynamism is redundant within 
the general framework of Hull's 
behavior theory, and the empirical 
data are in keeping with this implica- 
tion. The third and fourth experi- 
ments described in this paper went 
one step further, however, in showing 
Hull’s assumption of a multiplicative 
relationship between stimulus inten- 
sity and drive strength to be invalid 
as well as redundant. 


SUMMARY 


Following an outline of the Hullian 
and Perkins-Logan theories of stim- 
ulus-intensity effects, and a distinc- 
tion between the role of these effects 
in learning and performance, four 
experiments were described which 
tested the relative merits of the two 
theories in situations limited to effects 
on performance. In the Bret d 

nt it was found that, with stimulus 
ine et as a CS, response strength late 
offset as « 
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in training was directly related to the 
strength of the offset stimulus. Be- 
cause this finding could be readily 
accounted for by a modification. of 
Hull's stimulus-intensity dynamism 
postulate couched in terms of degree 
of stimulus change as the effective 
variable, regardless of the direction of 
the change, the second experiment 
was designed to detect any negative 
transfer occurring with a shift from 
onset to offset or vice versa in the one 
group of subjects. Negative transfer 
did occur and this outcome was taken 
to be incompatible with the modifica- 
tion of Hull's postulate and as further 
evidence for the Perkins-Logan theory 
in terms of differential conditioning. 
In the third and fourth experiments 
tests were made of a deduction from 
the latter theory to the effect that 
there should be an additive relation- 
ship between stimulus strength and 
drive strength in their joint determ!- 
nation of response strength, contrary 
to Hull's assumption of a multiplica- 
tive relationship. The two studies 
provided evidence for an additive 
relationship. It was concluded from 
the series of experiments as a whole; 
that the Perkins-Logan theory is tO 
be preferred to Hull's treatment 
of stimulus-intensity effects. Some 
other implications of the former 
theory were also considered, mainly 
through comparisons with discrimina- 
tion learning. 
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CONDITIONING THROUGH VICARIOUS INSTIGATION 


SEYMOUR M. BERGER? 


Indiana University 


Personality theorists and social psy- 
chologists generally recognize that the 
emotional responses of one person 
(performer) may elicit emotional re- 
sponses from another (observer). 
When these emotional responses are 
similar, the relationship between the 
performer and observer is described as 
empathetic, or one of identification. If 
the relationship is one of envy, the per- 
former’s pleasure elicits an unpleasant 
emotional reaction from the observer. 
In a sadistic (malicious joy) relation- 
ship, the performer's displeasure elicits 
a joyful response from the observer, 
In all of these relationships, the ob- 
server’s emotional response is depend- 
ent upon the emotional response of the 
performer. As social phenomena, 
empathy, envy, and sadism require that 
the observer experience the emotion of 
the performer vicariously, in the sense 
of perceiving it, as a prelude to the 
observer's own emotional response. 

The purpose of this paper is to pro- 
vide a conceptual framework within 
which to study empathy, envy, and 
sadism as determinants of interpersonal 
behavior. For example, empathy may 
account for certain forms of altruism, 
envy for the loss of friendship, and 
sadism for malicious mischief. Before 
proceeding to these problems, however, 
it is necessary to identify a basic mech- 
anism in the arousal of the observer's 


emotional response. 
Vicarious INSTIGATION 


If an observer responds emotionally 
to a performer's unconditioned emo- 


1 The author wishes to express his iidebt- 
edness to at least two colleges yaam 
Levine and Leon H. Levy, for their a 
couragement, many helpful suggestions, 
critical reading of the manuscript. 


tional response (UER) then vicarious 
instigation has occurred.? The per- 
former's UER requires more precise 
definition. It is meant to refer to the 
performer's emotional state, following 
the presentation of an unconditioned 
stimulus (UCS), as perceived by the 
observer. The actual state of the per- 
former is not directly observable. The 
observer must infer the performer's 
UER from observable events in the 
environment. Typically, the observer 
will rely upon the UCS and the per- 
former's overt unconditioned response 
(UCR) as cues to the performer's 
UER. Consequently, the performer's 
UCS and UCR are important deter- 
mining factors in vicarious instigation. 

The performer's UER need not ac- 
tually occur, the observer may infer it 
(incorrectly) from the situational con- 
text. It is only necessary that the ob- 
Server perceive that the performer's 
UER has occurred, and respond emo- 
tionally, for vicarious instigation. A 
mother, for example, may react with 
fear and anguish when she sees her 
child fall, only to find that the child 
is unhurt and undisturbed. In this ex- 
ample, the mother’s emotional response 
(ER) was dependent upon her per- 
ception of what had happened to her 
child (the UCS) rather than the child's 
reaction (UCR). She misperceived 
the child’s emotional state in this case, 
yet she was vicariously instigated ; she 
inferred the child's UER from the 
. * Throughout this paper, the term t1condi- 
tioned refers to dependable S-R relationships 
Which serve as a basis for conditioning. This 
usage is in agreement with the definition em- 
ployed by many writers in the area of condi- 
tioning (eg, Hilgard & Marquis, 1961, pp. 
46-47). In this procedural sense, the term 
unconditioned may refer to either learned or 
unlearned S-R relationships. 
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UCS only. Had she been in another 
room and heard the child cry (UCR), 
the mother might have inferred that 
her child was in pain (UER) without 
knowledge of the precipitating event 
(UCS), and yet would have been vicar- 
iously instigated. While the perform- 
er's UCS and UCR are important cues 
for vicarious instigation, both are not 
always necessary for vicarious insti- 
gation. 


Pseudovicarious Instigation 


The simple contiguity between the 
performer’s UCS and UCR and the 
observer’s ER is not sufficient evidence 
for vicarious instigation, however. 
Since the observer's ER follows the 
performers UCS and UCR, the ob- 
server may appear to be responding to 
the performer’s UER when in actuality 
he is responding to some other stimulus 
component in the situation. As indi- 
cated above, vicarious instigation is de- 
pendent upon the performer’s UCR or 
the UCS. In some situations, however, 
the observer may respond to the UCS 
only, so that the performer's UER is 
superfluous, An observer may respond 
with fear to the firing of a gun, regard- 
less of whether the target is a bull’s-eye 
or another human being. The observ- 
er’s fear is not vicariously instigated in 
this instance—although it may appear 
to be if the target happens to be another 
human being. 

Another possible source of pseudovi- 
carious instigation is the performer’s 
UCR. The performer’s UCR may bea 
UCS which elicits an observer ER; in 
this case, the observer responds to the 
performer’s UCR only, so that the per- 
former’s UCS and UER are superflu- 
ous. A sudden loud scream by a per- 
former, for example, may elicit a fear 
response from the observer independ- 
ently of his response to the performer’s 
UCS or UER. Recent studies by 
Church (1959) and Murphy, Miller, 
and Mirsky (1955) have demonstrated 
how a rat’s squeal and a monkey’s pres- 
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ence, respectively, may serve as a con- 
ditioned fear stimulus. 

The performer's UER may serve as 
a discriminative stimulus which indi- 
cates to the observer that he will re- 
ceive the UCS that is presented to the 
performer. This is another source of 
pseudovicarious instigation. One child 
may fear that he will be spanked when 
he sees that his brother is being 
spanked for no apparent reason. It is 
not his brother’s cries in response to the 
spanking that elicit the child's fear, 
rather it is the anticipation of being 
spanked himself, 

The following study by Barnett and 
Benedetti (1960) illustrates a form of 
pseudovicarious instigation which may 
occur in conditioning. In this case, an 
extraneous stimulus in the situation 
served as a UCS: 

After adapting the GSR of an ob- 
server to a buzzer, the authors had him 
watch a performer undergoing GSR 
conditioning, with shock as the UCS 
and the buzzer as the conditioned stim- 
ulus (CS). Even though most observ- 
ers were not aware that the performer 
was being shocked, they responded 
with GSRs to the buzzer on test trials. 
Subsequent experimentation concern- 
ing the various stimulus components of 
the situation led the authors to con- 
clude that what appeared to be “vicari- 
ous conditioning" (their term) in- 
volved responses to visible needle de- 
flections of the galvanometer, even in 
the absence of the performer; the 
GSRs, in this case, reflecting changes 
in activation level following the occur- 
rence of a stimulus change, i.e., needle 
movements. 

All of these situations are not in- 
stances of vicarious instigation, al- 
though they would appear to be in 
casual observation. Obviously, there- 
fore, anecdotal evidence for vicarious 
instigation is of dubious value, and an 
experimental analysis is required to 
eliminate the possibility of pseudovi- 
carious instigation. 
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Concordant and Discordant Forms? 


The definition of vicarious instiga- 
tion requires that the observer respond 
emotionally to the performer’s UER, 
but there is no restriction placed upon 
the nature of these emotional responses. 
As indicated previously, there may be 
a certain similarity or dissimilarity be- 
tween the emotional responses of the 
performer and observer. Concordant 
vicarious instigation takes place when 
performer and observer responses are 
similar *; discordant vicarious instiga- 
tion occurs when performer and ob- 
Server responses are dissimilar. An 
examination of concordant and discord- 
ant forms of vicarious instigation pro- 
vides a rudimentary classification Sys- 
tem for empathy, envy, and sadism. 

If discussion is limited to the elicita- 
tion of emotional responses which may 
be classified as either pleasant (+) or 
unpleasant (—), it is possible to specify 
the necessary conditions for empathy, 
envy, and sadism. The four possible 


? Although not specifically concerned with 
the problem of conditioning, Heider (1958, 
Ch. 11) has presented an analysis which de- 
scribes one person's reactions *to the lot of 
the other.” There are some parallels between 
the analysis of conditioning through vicarious 
instigation and Heider's Presentation. The 
present analysis attempts to deal with similar 
problems, but in terms of conditioning 
models. 

* Some authors (e.g. Asch, 1932, p. 171; 
Heider, 1958, pp. 279-282) distinguish be- 
tween two forms of concordant emotional re- 
sponse. The observer may react concordantly 
with the performer's response to the uncon- 
ditioned stimulus (UCS) and in a manner 
that is concordant with the way that the 
observer would react if he had been directly 

instigated; the observer may react GE 
antly with the performer s response to the 
UCS, but in a manner that is Poor rcs E 
the way the observer would emp e ad 
been directly instigated. In the pn in 
stance, the observer is reacting UT heti- 
cally, while in the latter instance t E cer id 
is reacting sympathetically. This E m 
while useful for some purposes, pe e a 
regarded in this paper because it does 
advance the present analysis. 


TABLE 1 


CoMBINATIONS OF EMOTIONAL RESPONSES 
FOR PERFORMER AND OBSERVER 


Case Performer's UER Observer's ER 
1 | TE + 
II + = 
ll m 
IV | " E 


combinations of pleasant and unpleas- 
ant emotional responses for performer 
and observer are presented in Table 1. 
It should be noted that throughout this 
discussion, the classification of the per- 
former’s UER is in terms of the ob- 
server’s perception of it. 

Empathy may be defined by Cases I 
and IV; shocking the performer (—) 
may be painful (—) for the observer, 
for example. Envy may be defined by 
Case II; rewarding the performer (+) 
may instigate a feeling of depression in 
the observer (—). Sadism may be de- 
fined by Case III; Shocking the per- 
former (—) may 
observer (4-). 

Although these combinations of per- 
former and observer responses seem to 
identify the necessary conditions for 
characterizing empathy, envy, and sad- 
ism, the present scheme does not in- 
clude the sufficient conditions for com- 
plete characterization, 

Nevertheless, this initial effort sug- 
gests a useful direction to pursue in 
order to bring a wider variety of emo- 


tional responses within the scope of 
conditioning theory. 


Conditioning 


As a motivational concept, vicarious 
instigation has ‘relevance to a variety 
of social behaviors acquired through 
classical and instrumental conditioning. 
In the discussion which follows, an at- 
tempt is made to identify two condi- 
tioning models in which social learning 
is a function of vicarious instigation. 


elicit joy in the: 


m 


so 
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CLASSICAL CONDITIONING PARADIGM: 
THe PERFORMERS UER as a UCS 


In classical conditioning, in order for 
a response to become conditioned to a 
neutral stimulus (CS), the response 
must be elicited by the UCS. Any 
stimulus may serve as the UCS if it 
will elicit a given response with some 
degree of regularity. In vicarious in- 
stigation, the performer's UER serves 
as a UCS for the observer’s ER. 

The distinction between vicarious in- 
stigation and conditioning may be il- 
lustrated as follows: Haner and Whit- 
ney (1960) had an observer watch a 
performer who supposedly was being 
shocked each time a light was pre- 
sented. The authors found that this 
situation elicited GSRs from the ob- 
servers, the GSR size being positively 
related to the general anxiety level of 
the observer. But no test was made to 
determine whether the observer’s GSRs 
became conditioned to the light alone. 
While Haner and Whitney call this 
phenomenon “empathic conditioning,” 
the term vicarious instigation seems 
more appropriate. 

In the classical conditioning model, 
vicarious instigation must occur in con- 
tiguity with a CS. Conditioning occurs 
when the CS comes to elicit the ob- 
server’s ER in the absence of the per- 
former’s UER. While this model may 
be applied to a variety of social learning 
situations, an example of concordant vi- 
carious instigation in the formation of 
attitudes may be found in this excerpt 
from Gordon W. Allport (1935) : 


Even before he has an adequate background 
of appropriate experience, a child may form 
many intense and lasting attitudes toward 
races and professions, toward religion and 
marriage, toward foreigners and servants, 
and toward morality and sin. A parent's 
tone of voice in disapproving of ragamuflins 
who live along the railroad track is enough 
to produce an uncritical attitude in the child 
who has no basis in his experience for the 
rational adoption of the parent's point of view 


[italics added]. It frequently happens that 
subsequent experience is fitted into the at- 
titude thus uncritically adopted, not . . 
made the basis for the attitude. In such 
cases every contact is prejudged, contradic- 
tory evidence is not admitted, and the attitude 
which is borrowed secondhand [italics added] 
is triumphant (p. 811). 


In terms of the model, the “raga- 
muffins" are the CS, the parent's UER 
is inferred from the disapproving tone 
of voice, and the child's uncritical nega- 
tive attitude is the observer's ER. 
Once the emotional component of the 
attitude is conditioned (“borrowed 
secondhand”), the cognitive elements 
are added by subsequent experience. 

Although attitudes are formed in a 
variety of ways, the present model pro- 
vides a conceptual and experimental 
basis for studying the transmission of 
ready-made attitudes through observa- 
tional learning in a classical condition- 
ing framework, One can only specu- 
late concerning the variety of social 
experiences that may be transmitted in 
this way. 


INSTRUMENTAL CONDITIONING PARA- 
DIGM: ViCARIOUS INSTIGATION 
AS A REINFORCEMENT 


Vicarious instigation may serve as 
either a positive or negative reinforce- 
ment for observer responses in instru- 
mental conditioning situations. Each 
type of reinforcement will be con- 
sidered separately. 

As a positive reinforcement, vicari- 
ous instigation strengthens the ob- 
server responses it follows. It is as- 
sumed that a positive ER is instigated 
in the observer. Therefore, pleasant 
empathetic feelings or malicious joy 
would be the basis for the positive rein- 
forcement. Altruism, for example, is 
assumed to be based upon concordant 
vicarious instigation where both per- 
former and observer emotional re- 
sponses are positive. To illustrate, I 
may become more charitable when my 
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gift is received with obvious pleasure 
by the recipient. If perceiving the 
pleasure of the recipient is vicariously 
instigating, it may reinforce my chari- 
table behavior. Of course, sources of 
pseudovicarious instigation would have 
to be controlled, since charitable be- 
havior may be reinforced by approval 
I receive from the recipient or from my 
family and friends. 

""Tattletale" behavior is an example 
of how a sadistic relationship may af- 
fect conditioning. One child may 
learn that he can have his rival pun- 
ished by disclosing the rival's misdeeds 
to his mother. Presumably, the enjoy- 
ment elicited by this rival's punishment 
reinforces the  tattletale behavior. 
Again, experimental analysis is neces- 
sary to eliminate pseudoeffects ; eg., 
the child may be seeking the approval of 
the mother and a raise in his own 
status. 

Vicarious instigation may function as 
a negative reinforcement if responses 
are strengthened when followed by the 
removal or reduction of vicarious in- 
stigation. If it is assumed that emo- 
tional states have motivating properties 
then, following Miller's (1948) para- 
digm, vicariously instigated emotions 
may function to initiate learning; fur- 
thermore, the observer will acquire re- 
sponses which reduce the emotional 
drive stimulus. 

The performer’s UER may instigate 
the observer’s ER (assumed to be neg- 
ative). The observer’s ER is the 
drive stimulus which instigates other 
responses. If one of these other re- 
sponses is followed by the removal of 
the performer's UER, vicarious insti- 
gation is terminated, and so is the ob- 
server's ER. The response which was 
followed by the termination of vicarious 

instigation will be strengthened. : 

This model seems applicable to situ- 
hen the vicariously instigated 


ations W 1 
is negative. Consequently, 
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both unpleasant empathetic feelings and 
feelings of envy can function as drive 
stimuli in this model. Consider, for ex- 
ample, the starving beggar who comes 
to the door for a handout. The house- 
wife may feel grief at the sight of the 
emaciated beggar and upon hearing his 
plea. She may be able to reduce her 
grief by shutting the door to remove 
the pitiful figure from her view. In 
this case, the beggar’s state (perform- 
er’s UER) aroused the housewife’s 
grief (observer's ER) which was re- 
duced by shutting the door. Presum- 
ably, the response of shutting the door 
was effective in reducing the feeling 
of grief and was reinforced. There are, 
of course, other potential sources of re- 
inforcement in this type of situation. 
For example, the housewife may feel 
threatened by the presence of a stranger 
at her door, and may shut the door to 
insure her own safety. Clearly, 
Sources of pseudoeffects must be ruled 
out before it can be concluded that the 
behavior was a function of vicarious 
instigation. 

Envy may also function as a drive 
stimulus in this model of conditioning. 
One man may be envious when his 
rival is promoted, and attempt to re- 
duce his feelings of mortification by 
subsequently avoiding all encounters 
with his rival. Again, it should be 
noted that an analysis for pseudovicari- 
ous instigation must be made (e.g., the 
man may be responding to his loss of 
the promotion rather than his rival’s 
good fortune) before concluding that 
vicarious instigation is the basis for the 
behavior. 

A number of writers (e.g., Berger, 
1961; Campbell, 1961; Hill, 1960; 
Sechrest, 1961) have used the term 
vicarious reinforcement to describe 
learning phenomena. The present 
model identifies vicarious instigation as 
a possible basis for vicarious rein- 
forcement. 


ee 
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AN EXPERIMENTAL DEMONSTRATION 
or Vicarious INSTIGATION IN 
CLASSICAL CONDITIONING * 


The foregoing analysis has employed 
anecdotal evidence and hypothetical ex- 
amples to illustrate and support the 
conceptual framework. In light of the 
discussion of pseudovicarious instiga- 
tion, an experimental analysis is im- 
perative, yet lacking. Since vicarious 
instigation is crucial to the framework, 
demonstrations seemed in order. Asa 
starting point, the author selected the 
classical conditioning model as a basis 
for demonstrating vicarious instigation 
in learning. 

Although developed independently, 
the general setting for the experiments 
to be reported is similar to the situa- 
tions used by Haner and Whitney 
(1960) and Barnett and Benedetti 
(1960). In this situation, an observer 
watches a performer (actually a con- 
federate of the experimenter, who sup- 
posedly is being shocked every time a 
light dims. The dimming of the light 
is preceded by a buzzer. The GSRs of 
the observer are recorded as a measure 
of his ER. Everytime the light dims 
and the performer is supposedly 
shocked, he jerks his arm from the in- 
ductorium.® The buzzer was the CS. 
The performers UER was inferred 
from the dimming of the light and elec- 
tric shock (performer's UCS) and the 
performer’s arm movement (perform- 
ers UCR). Presumably, if the ob- 


5 The author wishes to express his appre- 
ciation to Indiana University for an equip- 
ment grant in support of this research, and to 
Neil Schneiderman, Susan Kroener, and Pa- 
tricia Stewart who were the performers and 
scorers in these studies. The experimenter 
for Experiment I was the author, Patricia 
Stewart was the experimenter in Experiment 
II, Susan Kroener was the experimenter in 
Experiment III. : A 

8 The performer jerked his arm rapidly, as 
a twitch. The performer tried not to make 
other movements, or change facial expression. 


server responded with a GSR on condi- 
tioning trials, vicarious instigation oc- 
curred. It was necessary, of course, to 
introduce pseudovicarious instigation 
controls, 

The previous discussion of pseudovi- 
carious instigation suggested three gen- 
eral sources of pseudovicarious instiga- 
tion: The performer's response, per se; 
the performer's UCS, per se; and the 
observer's anticipation that the UCS 
will be presented to him. 


EXPERIMENT I 


In an initial experiment, an attempt 
was made to obtain information con- 
cerning the effectiveness of the experi- 
mental situation for vicarious instiga- 
tion, and to determine if the instigation 
of observer GSRs by apparent shock 
to the performer was due to the per- 
former’s response, per se. The experi- 
ment contained two conditions: 

1. Shock- Movement (S-M)—the ob- 
server was told that the performer 
would be shocked and the performer 
reacted with an arm movement when 
apparently being shocked. 

2. No Shock-Movement (NS-M)— 
the observer was told that the per- 
former would make a voluntary arm 
movement at a given signal (light dim- 
ming), and that the performer was not 
being shocked. 

If no differences were found between 
these conditions, then it would appear 
that observers were responding solely 
to the performer's movement, and that 
the performers UER was not a de- 
termining factor. If differences were 
found, where S-M observers were 
more responsive, then some support 
would be provided for the phenomenon 
of vicarious instigation. 

Fifteen subjects were assigned at random 
to each condition. The performer (a male 
confederate of the experimenter who ap- 


peared to be just another subject recruited 
for the experiment) was the same for all 
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subjects. In order to avoid any systematic 
bias in the performer's behavior, the instruc- 
tions regarding the different conditions were 
read to the observer before the performer ap- 
peared for the experiment. For observers in 
the S-M condition, the experimenter ex- 
plained that another subject would be shocked 
in the last part of the experiment, and that 
he would be a few minutes late because the 
experimenter's assistant was testing the other 
subject to see what level of shock he could 
take; for observers in the NS-M condition, 
the performer’s initial absence was explained 
on the basis that the experimenter's assistant 
was giving the other subject practice in mak- 
ing the arm movement. Instructions, which 
were common to both conditions and did not 
reveal the specific nature of the condition, 
were read aloud when performer and ob- 
server were present together. 

The experiment was described to the ob- 
server as an attempt to determine whether 
different kinds of stimulation (buzzer and 
light diming) produced different types of 
skin resistance changes. S-M observers were 
told that the performer would be shocked 
every time the light dimmed in order to de- 
termine the effect of shock on the perform- 
er's GSR. The observer was reassured that 
he was a control subject and would not be 
shocked at any time. NS-M observers were 
told that the performer would try to jerk his 
hand off the apparatus in front of him every 
time the light dimmed to see if the movement 
affected his GSR. The observer was told 
that he was a control subject and that no one 
would be shocked. 

Although the experimenter appeared to be 
taking readings from both the performer and 
observer, only the observer's GSRs were 
recorded. After an initial adaptation period, 
in which the buzzer was presented for 1,0 
second followed immediately by dimming of 
the light for 1.0 second, the performer’s hand 
was strapped to the inductorium in front of 
him. It was only during this latter phase of 
the experiment that the performer made arm 
movements. 

There were 13 trials during this phase of 
the experiment. Ten trials were used for 
conditioning, with. 3 test trials interspersed 
in the same position for all subjects. The 

intertrial interval was 15 seconds. ; 

The method of moving averages with 
blocks of three trials was used for all curves 
presented in this paper, using the number 
of observers giving a GSR on given trials as 
the dependent response measure. For pur- 
oses of statistical analysis, the total pe 
of GSRs for each observer, for a given s 


of trials, was used as the dependent rc- 
sponse measure in all experiments. 

Three criteria were used to identify a GSR 
response : 

1. The movement of the recording pen had 
to be in the direction of lowered resistance. 

2. The movement had to be at least as 
large as twice the thickness of the recording 
pen line. 

3. The movement had to occur within 4 
seconds following the termination of the 
buzzer. 


The records were scored independently by 
the experimenter and the confederate, al- 
though both had some knowledge as to 
Which conditions given observers had been 
assigned. The scorers agreed on 94% of the 
trials concerning whether or not a GSR had 
Occurred; disagreements were resolved by 
re-examination and discussion by the scorers. 


The criterion for adaptation during 
the first phase of the experiment was 
two successive no response trials; all 
Observers were given a minimum of 
four adaptation trials, however. The 
adaptation curves over the four trials 
are presented in Figure 1, While it is 
apparent from Figure 1 that subjects 
in the S-M condition were less respon- 
sive than subjects in the NS-M condi- 
tion, a comparison of the total number 
of individual responses over the four 
adaptation trials for each condition by 
the Mann-Whitney U test revealed no 
reliable differences (U —81.5).* 

The relative effectiveness of the two 
conditions to instigate observer GSRs 
may be examined in the "conditioning" 
Section of Figure 1. The total scores 
for individual observers in the two con- 
ditions (on trials in which the con- 
federate responded) were compared by 
the Mann-Whitney U test, S-M ob- 
Servers appear to be more responsive 


* Two-tailed tests are used throughout this 
Paper. The Mann-Whitney test was em- 
ployed in Experiment I because the compari- 
Son groups differed significantly in variance 
uncorrectable by usual transformation tech- 
niques) and, at the same time, appeared to 
have differently shaped distributions, The 
data for Experiments II and III were more 
amenable to analysis of variance according 
to the criteria suggested by Boneau (1960). 
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Fic. 1. Percentage of observers giving GSRs during adaptation, conditioning, 
and test trials (Experiment I). 


than NS-M observers, but the differ- 
ence is not reliable (U = 70.5, p < .10). 

The total number of GSRs over the 
three test trials was obtained for each 
observer and the Mann-Whitney U test 
was used to compare the differences 
between the conditions. The “test 
trial” section of Figure 1 shows that 
S-M observers gave more responses 
on test trials than NS-M observers. 
The total score differences were reli- 
able at less than the .05 level (U — 
61.5). 

A post-experiment questionnaire re- 
vealed that most observers gave de- 
scriptions of the performer's situation 
which were in agreement with the ex- 
perimental instructions. Two NS-M 
observers reported that the performer 
might be receiving a shock. One ob- 
server in each condition reported some 
expectation of receiving shock. 

These results lend support to the 
notion that conditioning can occur 
through vicarious instigation. While 
the arousal effects of the arm movement 
are apparent from the data, there is 
some tendency for GSR instigation to 


occur more frequently when the ob- 
server is told that the performer 1s 
being shocked. Furthermore, the fre- 
quency of conditioned responses (CRs) 
is reliably greater in the S-M condition. 
Conditioning was not solely a function 
of the performer's response, but was de- 
pendent upon the instructions concern- 
ing shock. 


EXPERIMENT IT 


The encouraging results of Experi- 
ment I testified to the utility of the 
experimental setting in studying vicari- 
ous instigation. Yet, Experiment I 
contained only two pseudovicarious in- 
stigation controls; i.e., the NS-M con- 
dition, for performer response, and the 
instructions to the observer that he 
would not be shocked. A third control 
is necessary to determine whether the 
performer's UCS per se, will account 
for the differences obtained. It is con- 
ceivable that S-M observer’s were af- 
fected solely by the instructions con- 
cerning shock, and that the performer's 
response was not a contributing factor; 
or, that the apparent presence of elec- 
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tricity in the inductorium had arousal 
effects which were independent of the 
performer being shocked. Since the 
differences between the movement con- 
ditions for instigations was not reliable 
in Experiment 1, a complete 2 x 2 fac- 
torial design involving shock and move- 
ment was employed in this experiment. 
This design provided for the various 
pseudovicarious instigation controls. 
The four conditions of this experiment 
are identified as follows: 

1. S-M—Shock to performer, and 
performer arm movement. 

2. S-NM—Shock to performer, but 
no performer arm movement. 

3. NS-M—No shock to performer, 
but performer arm movement. 

4. NS-NM—No shock to performer, 
and no performer arm movement, 

The S-M and NS-M conditions were 
the same conditions used in Experi- 
ment L The S-NM condition is a 
"UCS control" for the instructions con- 
cerning shock and an electrified induc- 
torium; the performer made no re- 
sponse. The NS-NM condition was 
another UCS control; the performer’s 
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hand was not placed on the inductorium 
during the conditioning phase of the 
experiment. She was told that she 
would not be Shocked, and should not 
touch the inductorium when the light 
dimmed or else she would be shocked. 
This condition was used to assess the 
arousal effect of an electrified induc- 
torium without shock to the performer. 

Twenty-four subjects were assigned to 
each condition so that each condition con- 
tained an equal number of males and females. 

A female confederate acted as the per- 
former in this experiment. In addition to the 
different instructions for the various con- 
ditions, there Were some procedural changes. 
In Xperiment I, an attempt was made to use 
a "blind" Performer by having the critical 
Instructions for the different conditions read 
to the observer before the performer entered 
the laboratory, This procedure was aban- 
doned in this experiment for two reasons: the 
performer in Experiment I Írequently could 
identify the condition being used by the kinds 
of Overt responses the Observer made (e.g. 
smiling, looking away when the performer 
jerked, apparent boredom, etc.), and the per- 
former in this experiment had to know what 
condition she was in so that she could behave 
appropriately, 

The criteria for Scoring GSRs were the 
same as in Experiment I, except that the pen 
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movement in the direction of lowered re- 
sistance need only have been greater than the 
thickness of the pen line (rather than twice 
the thickness). This change represented an 
attempt to increase the sensitivity of the 
GSR measure, particularly with regard to 
CRs, which generally occurred relatively in- 
frequently in Experiment I. Scoring was 
done independently, and without identification 
of the condition to which individual observers 
were assigned. The scorers agreed on 96% 
of the trials concerning the occurrence of 
GSRs; disagreements were resolved by re- 
examination and discussion by the scorers. 


No differences in observer GSRs 
were observed between conditions dur- 
ing the adaptation period, as may be 
seen in Figure 2. 

Table 2 summarizes the analysis of 
variance performed on the total scores 
for observers in the different conditions 
over the 10 conditioning trials. 

Since the purpose of this study was 
to determine whether the differential 
effect of shock instructions found in 
Experiment I would occur without per- 
former arm movement, the Shock x 
Movement interaction in the analysis 
of variance is particularly relevant. 
This interaction indicates that the dif- 
ferential effect of shock instructions de- 
pends upon whether or not performer 
arm movements are present. There is 
a significant difference between the fre- 
quency of GSRs given by S-M observ- 
ers and the frequency of GSRs given 
by observers in the NS-M condition 
(^ = 4.33, df = 46, p < .01); this dif- 
ference is not significant between the 
observers in the S-NM and NS-NM 
conditions. 

The main effect for Movement re- 
veals that performer arm movements 
are effective elicitors of the GSR, re- 
gardless of shock instructions. The ¢ 
between S-M and S-NM observers is 
5.89, df = 46, p < .01; the t between 
the NS-M and NS-NM observers is 
2.92, df = 46, p < 0l. 

The main effect for Shock is prima- 
rily due to the difference between the 
S-M and NS-M observers, the differ- 


TABLE 2 


SUMMARY OF ANALYSIS OF VARIANCE OF 
GSR INSTIGATION SCORES 


Source df MS F 


Shock instructions 
Performer movement 
Sex of observer 


94.01 | 16.79** 
219.01 | 39.11** 
460| — 


Shock X Movement 6.92* 
Shock X Sex $01| — 
Movement X Sex 7.58 | 1.36 
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Shock X Movement 29] = 
X Sex 
Residual 88 5.60 


*p «.025. 
** p» «.001. 


ence between the S-NM and NS-NM 
observers being small and not signifi- 
cant. 

The ż tests reported above show that 
observers in the S-M condition differ 
from observers in the NS-M and 
S-NM conditions. The S-M observers 
also differ from NS-NM observers 
(t — 7.35, df = 46, p < .01). 

The differences between conditions 
over conditioning trials may be seen 
in Figure 2. 

The results of Experiment I sug- 
gested instigation was not solely a func- 
tion of the performer's arm movement. 
This finding is strongly supported by 
the results of this experiment; a very 
reliable difference was found between 
the S-M and the NS-M conditions. 
In addition, this study reveals that the 
instructions concerning shock to the 
performer (S-NM condition), or that 
electricity is being passed to the induc- 
torium (NS-NM condition) are not 
sufficient in themselves to account for 
the frequency of observer GSRs in the 
S-M condition. It is apparent from 
these data that frequency of instigation 
depends upon the instructions concern- 
ing shock and the occurrence of the 
performer's response. 

Figure 2 also presents the test trial 
data for observers in four conditions. 
An analysis of variance of the test trials 
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scores revealed no reliable differences 
in conditioning. The order of condi- 
tions, based upon the magnitude of the 
means for CRs, is in line with the 
order of conditions based upon the 
means for instigations. The frequency 
of CRs tends to be greatest in the 
S-M condition, although the difference 
between S-M observers and observers 
in the other conditions was not great 
enough to yield statistically significant 
results. 

Although reliable differences were 
found between the conditioning results 
from the S-M condition and the NS-M 
condition in Experiment I, the differ- 
ences between responsiveness on test 
trials for these conditions in this study 
were not reliable, but in the same 
direction. 

A content analysis of the post-experi- 
ment questionnaire data attempted to 
answer two questions : Did the observer 
describe the performer's actions in ac- 
cordance with the instructions he re- 
ceived? Upon questioning, did the 
observer report that he expected a 
shock during the last part of the ex- 
periment? 

In the S-M condition, one observer 
reported that he did not think the per- 
former was being shocked, while this 
report was given by eight observers in 
the S-NM condition. Judging from 
their reports, a large number of obsery- 
ers in the S-NM condition believed 
that the performer was receiving a very 
mild shock or no shock at all; this 
may account for the relatively low fre- 
quency of instigations in this condition, 
Many S-NM observers did not per- 
ceive that the performer was being 
shocked or made uncomfortable. Five 
observers in the NS-NM condition re- 

ported that they did not think the fe 
former would be shocked if she touched 
the inductorium. The eed of E 
servers, however, described the condi- 


tions of the experiment in accordance 
io 
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with the instructions given, the one 
major exception being the S-NM ob- 
servers. 

Most of the observers who reported 
that they expected to receive a shock 
during the last part of the experiment, 
apparently expected the shock on the 
first trial or two, and when it did not 
happen, they no longer expected to be 
shocked. 

When the mean GSRs from observ- 
ers who expected shock are compared 
with the general means for the relevant 
conditions, it appears that all means are 
essentially equal to or less than the 
general mean for the particular con- 
dition—except for the instigation mean 
of S-M observers. Of the two observ- 
ers in the S-M condition who expected 
shock, one gave GSRs on 8 out of 10 
conditioning trials, which is equal to 
the median number of responses given 
by observers in the S-M condition ; 
the other observer gave GSRs on all 
10 trials, which is equal to the score 
of one-third of the observers in the 
S-M condition. Of the 24 observers 
in each condition, 2 S-M observers, 3 
S-NM observers, 5 NS-M observers, 
and 5 NS-NM observers reported some 
expectation of receiving shock. Conse- 
quently, even though a small number of 
Observers in each condition reported 
that they expected to be shocked, there 
is nothing in their data to suggest that 
this is a significant factor in their GSR 
responsiveness, 

While there is general agreement be- 
tween these findings and the results of 
Experiment I, the differences between 
the S-M and NS-M conditions with re- 
Spect to instigation and conditioning 
Were not consistently as large in both 
Studies. When the probabilities of the 
Observed differences are considered to- 
gether, there is good support for the 
phenomenon of conditioning through 
vicarious instigation. Yet, it appears 
that some refinements in measuring 
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techniques might improve the reliabili- 
ties of the differences. 


EXPERIMENT III 


There is some possibility that the in- 
structions and performer reactions re- 
sensitized the observer's response to the 
buzzer during the second part of the 
experiment, so that the measures of in- 
stigation and conditioning were con- 
founded with a resensitized response to 
the buzzer, per se. Furthermore, the 
use of only three test trials may have 
been insensitive to some of the condi- 
tioning that occurred. For these rea- 
sons, the measurement technique was 
changed to conform to the recommen- 
dations of Stewart, Stern, Winokur, 
and Fredman (1961), in which the 
latency of the GSR was used to identify 
response measures. By taking into ac- 
count the latency of the GSR to a 
stimulus, these authors found it possi- 
ble to obtain separate measures of re- 
sponses to the CS and UCS, and to 
identify conditioned anticipatory re- 
sponses, for each trial. With this tech- 
nique, therefore, it is possible to have 
each presentation of the stimuli serve 
as a test trial for conditioning, and 
simultaneously obtain measures of re- 
sponses to the CS and to the UCS. 
The purpose of this experiment was to 
determine the utility of the latency 
technique for conditioning through vi- 
carious instigation. Only the S-M 
and NS-M conditions were examined 
in this experiment, using the same in- 
structional procedures as reported for 
Experiment ite 


Some procedural changes with regard to 
presentation of stimuli were introduced to 
accommodate the latency technique. During 
adaptation, only the buzzer was presented for 
6.5 seconds. The intertrial interval was 
varied at random (during adaptation and 
conditioning) from 20 to 60 seconds. During 
conditioning, the buzzer was presented for 
the same length of time as im adaptation. 
The dimming of the light for 1 second fol- 
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lowed the termination of the buzzer by .5 
second. As usual, the performer jerked her 
arm when the light dimmed. 

For scoring purposes, the stimulus interval 
was divided into three 4-second units. The 
first 4-second unit represented the response 
to the buzzer; the last 4-second unit, meas- 
ured from the end of the light dimming 
period, represented the response to the per- 
former’s movement; the second 4-second unit 
represented a conditioned anticipatory re- 
sponse. 

The criteria for identifying GSRs was the 
same as used in Experiment II, except that 
the pen movement in the direction of low- 
ered resistance had to occur within one of 
the 4-second intervals, and the movement had 
to indicate an accelerating change rather than 
a decelerating one; this latter criterion 
seemed necessary to distinguish GSRs initi- 
ated in one scoring period from one initiated 
in the next period, on a given trial. Again, 
scoring of the records was done independ- 
ently, without identification as to which con- 
dition the observer had been assigned. The 
scorers agreed on 9396 of the cases, and 
resolved their disagreements by discussion. 

The performer was a female in this ex- 
periment, but not the same person who was 
used in Experiment II. 

Sixteen subjects were assigned to each con- 
dition so that each condition contained an 
equal number of males and females. 


The adaptation for the two condi- 
tions was quite similar. Figure 3 
shows the response frequencies to the 
buzzer, the second, and the third scor- 
ing units for both conditions over the 
first four adaptation trials. Since only 
the buzzer was presented during adap- 
tation, responses occurring during the 
second and third scoring periods are 
assumed to be spontaneous. 

When responses to the buzzer dur- 
ing conditioning are compared to the 
buzzer responses occurring during 
adaptation (see Figure 3), it is ap- 
parent that observers are resensitized 
in the conditioning period. An analysis 
of variance based upon total responses 


8 The 4-second unit was used in Experi- 
ments I and II for identifying GSRs to the 
stimuli. It is also in agreement with the 
general findings of Stewart, Stern, Winokur, 
and Fredman (1961, p. 63). 
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to the buzzer during conditioning re- 
vealed no reliable differences between 
the two conditions. 

The curves presented in Figure 3 for 
instigated responses have the same gen- 
eral shape as the curves for the com- 
parable conditions in the two previous 
studies; S-M observers are more re- 
sponsive than NS-M observers. The 
analysis of variance based upon total 
instigations for each observer shows 
that this difference is reliable (see 
Table 3). , 

S-M observers gave more condi- 
tioned anticipatory responses than 
NS-M observers, as may be seen in 
Figure 3. The analysis of renun 
summarized in Table 4, shows that the 

difference in conditioned anticipatory 
response totals for observers in the two 
conditions is reliable. vw 
In this experiment, the OE 
asked each observer at the end of the 


Percentage of observers giving GSRs durin, 
conditioning (Experiment III). 


g adaptation and 


experiment if he expected to receive a 
shock. Only one observer in each con- 
dition reported that he expected shock. 

These findings provide additional 
Support for the phenomenon of condi- 
tioning through vicarious instigation. 
The general level of observer instiga- 
tion was similar to that found for com- 
parable conditions in Experiment II, 
but a general increase in the number of 
conditioned GSRs was noted in Ex- 
periment IIT, where anticipatory re- 


TABLE 3 


SuMMARY or ANALYSIS OF VARIANCE 
FOR INSTIGATIONS 


Source df MS F 
Shock instructions 1 | 40.50 | 5.36* 
Sex of observer 1 8.00 | 1.06 
Shock X Sex 1 0.00 
Residual 28 7.77 
*p <.05, 
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TABLE 4 
SUMMARY OF ANALYSIS OF VARIANCE FOR 
CONDITIONED ANTICIPATORY RESPONSES 


Source df MS P 
Shock instructions 1| 26.28 | 9.16* 
Sex of observer 1 2.53 — 
Shock X Sex 1 1.53 — 
Residual 28 2.87 
*p «0l. 


sponses were examined on all 10 trials 
by the latency criterion. 


DISCUSSION 


Considering all three studies to- 
gether, there is a good deal of consist- 
tency in the findings with regard to the 
S-M and NS-M conditions. Although 
the differences between these conditions 
for instigations and conditioned re- 
sponses was not consistently reliable in 
Experiments I and II, the trends in the 
differences were always in the same di- 
rection. With particular reference to 
the CR data, the finding in Experiment 
III that observers in both conditions 
are resensitized to the buzzer suggests 
that this factor may have tended to mit- 
igate the reliability of the differences. 

The reader may be concerned with 
the possibility that the differences be- 
tween conditions may be influenced by 
different base levels of resistance, pro- 
duced by the different treatments. The 
base levels of resistance (in ohms) 
were generally lower for S-M observers 
than for observers in the control con- 
ditions in these studies, although fre- 
quency of GSRs was greatest in the 
S-M condition. Since there is a posi- 
tive relationship between GSR size and 
base level of resistance (Haggard, 
1945), this would suggest that the ef- 
fect of base level differences tended to 
decrease the differences in frequency 
of GSRs between the S-M and control 
conditions, according to the GSR cri- 
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teria used. Consequently, if base level 
differences did influence the results of 
these studies, it was in the direction of 
minimizing vicarious instigation effects. 

It should be noted that the present 
experimental situation underplays the 
instigation of the performer. This was 
done in order to keep the S-M and 
NS-M conditions plausible and yet 
comparable. Presumably, the level of 
instigation (and consequently, condi- 
tioning) would be higher if the per- 
former's behavior was changed so as 
to provide greater evidence of pain. 
Actually, the present level of instiga- 
tion and conditioning is high enough to 
provide evidence for the phenomenon, 
and yet low enough to provide a basis 
for testing the facilitating effects of a 
variety of variables. 

In all of the data, observer GSRs 
tend to occur less frequently during 
the last half of the conditioning period 
than the first half. The explanation 
for this change is not entirely clear. 
One possible explanation is that GSR 
adaptation occurs because the observer 
becomes habituated to the specific situ- 
ation as a result of repeated presenta- 
tions of the stimuli, as Sears (1933) 
has shown. In addition, relatively 
short intertrial intervals tend to pro- 
duce rapid GSR adaptation to repeated 
stimuli (Coombs, 1938). It also is 
possible that observers pay less atten- 
tion to the stimuli as the experiment 
progresses. For example, some ob- 
servers in the S-M condition reported 
that they could not bear to watch the 
performer being shocked and looked 
away after the first few trials; observ- 
ers in the NS-M condition may have 
become bored with, and less attentive 
to, the repetitious movement. 

It might also be argued that vicarious 
instigation phenomena have a tran- 
sitory quality, the effect diminishing 
with successive exposures to the per- 
former's situation as the observer be- 
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gins to exercise some internal controls 
over his ERs. An adult audience wit- 
nessing a movie, for example, may be 
emotionally aroused as they watch their 
favorite star fall off a cliff for the first 
time; but their level of arousal will 
probably be lowered after witnessing 
the same scene over and over again. 
Individuals can exercise control over 
their emotions when they can anticipate 
the occurrence of an emotionally arous- 
ing situation; although, considering 
the number of times a child can sit 
through the same movie with appar- + 
ently undiminished enthusiasm, it 
would seem that children are less capa- 
ble than adults in exercising such 
control. 

Since it cannot be determined, from 
the data reported, whether the adapta- 
tion effects noted in these studies are 
a function of the characteristics of the 
GSR or a function of the characteristics 
of vicarious instigation phenomena gen- 
erally, it would seem that a different 
measure of vicarious instigation is 
needed in order to avoid this confound- 
ing. 

At the same time, some measure is 
needed to identify the nature of the 
observer’s ER, While the reader may 
be tempted to think of these experi- 
ments as reflecting some sort of ob- 
server empathy, there is no basis for 
this interpretation. Some observers 
may be empathizing and some obsery- 
ers may be enjoying the performer’s 
apparent pain. Research is presently 
underway which employs the evaluative 
scales of the Semantic Differential for 
this purpose. It may be possible to 
measure conditioning (as. Staats & 
Staats, 1957, have done), with the per- 

former's UER as the UCS for the ob- 
server. If this research is Successful, 
it not only should provide a basis for 
determining how attitudes and values 
ditioned through vicarious 
e qoe eo 
instigation, but it should be po 


to examine, independently of the GSR, 
whether adaptation of vicariously in- 
stigated responses occurs. 

The general experimental situation 
used in these studies appears to be a 
useful one for measuring conditioning 
through vicarious instigation. — Tt 
should be noted that the overall find- 
ings lend Support to the concept of vi- 
carious instigation as defined in the 
introduction to this paper: the instiga- 
tion and conditioning of the observer's 
GSR cannot be attributed to the per- 
former's movement, per se, or to the 
electrified inductorium, per se, or to 
the instructions concerning shock, per 
se. It is the apparent shock of the per- 
former (and by inference, the perform- 
er’s UER) which is the effective insti- 
gating stimulus for observer response. 


Vicarious Instigation and Conditioned 
Emotional Response 


Vicarious instigation is certainly not 
a new problem in social psychology. 
The phenomenon has been identified 
many times by other terminology. Mc- 
Dougall (1908) referred to the phe- 
nomenon as the “sympathetic induction 
of emotions.” Floyd H, Allport (1924) 
reacted strongly against this instinct 
doctrine and sought to bring the phe- 
nomenon within the scope of condi- 
tioning theory. 


A theory far more plausible than that of 
instinctive induction of emotion . . . is the 
principle of conditioned. emotional response. 
It may be illustrated by the panic which 
Seizes all the persons in a throng when a few 
of them show signs of terror. Granting that 
the true cause for alarm has been seen by 
only the original few, we have here a case 
of fear aroused by the process of sympathy. 

he explanation, according to the present 
theory, is as follows: we have been pre- 
viously terrified in company with others and 
So have our fear emotion transferred to 
characteristic attendant stimuli, such as cries 
and visible expressions of the emotion in 
those about us. We now react at once to 
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the sight of fear in others by a fear response 
of our own. Here the conception of sympa- 
thetic induction loses its force. We fear not 
merely because we see the expression of fear 
in others; but because we have learned to 
read these expressions as signs that there 
rcally is something to be afraid of. It is not 
fear induced from others that we experience, 
but our own fear of dangerous situations 
which has been conditioned by social stimuli 
(p. 235). 


Allport’s (1924) analysis argues 
against vicarious instigation, as defined 
in this paper; i.e., the expressed emo- 


tions of others are simply discrimina- | 


tive stimuli for subsequent presentation 
of the UCS to the observer. His analy- 
sis may hold for some situations, but it 
does not seem applicable to the experi- 
ments just reported. Observers were 
reassured that they would not be 
shocked, and therefore had nothing to 
fear for themselves; and most observ- 
ers reported that they were not con- 
cerned for themselves. Furthermore, 
the results of the experiments show that 
the arousal effects are greatest for 
S-M observers, when the performer’s 
situation is made to appear most un- 
pleasant; but the number of S-M ob- 
servers who apparently expected to be 
shocked is equal to or less than the 
number of observers who expected 
shock in the less arousing conditions. 
While these arguments are not con- 
clusive, they suggest that vicarious in- 
stigation is either a special case of con- 
ditioned emotional responding or that 
vicarious instigation depends upon a 
different learning principle. The reso- 
lution of this issue is not imminent, 
since further clarification of the nature 
of conditioned emotional responses and 
vicarious instigation is required before 
suitable arguments can be formulated. 


Vicarious Instigation in Relation to 
Experimental Operations 


The use of verbal instructions to ma- 
nipulate the independent variables, and 


a post-experiment questionnaire to 
check the subject’s acceptance of the 
instructions, limits the application of 
the designs used for experimental 
analysis of vicarious instigation to cer- 
tain classes of human subjects. It is 
clear that these experimental operations 
are not appropriate for work with 
lower animals, young children, or with 
mental retardates who have little verbal 
facility. 

One possible solution to this problem 
is to substitute specific training his- 
tories for the instructions. Observers 
first may be given direct experience 
with the condition that the performer 
will undergo, and then be exposed to a 
performer in that condition. Subse- 
quently, the observer may be tested to 
determine if his behavior was affected 
by exposure to the performer. Al- 
though it seems feasible to provide dif- 
ferent training histories for each ex- 
perimental and control condition used 
in studies of vicarious instigation, ex- 
perimental evidence is needed to dem- 
onstrate the suitability of the training 
technique for investigation of vicarious 
instigation phenomena. 

To some extent, the present defini- 
tion of vicarious instigation has a cer- 
tain conceptual inadequacy. In order 
to identify conditioning through vicari- 
ous instigation, it is necessary to show 
that sources of pseudovicarious insti- 
gation do not account for the behavior 
in question. From an experimental 
viewpoint, the presence of a particular 
factor is more easily demonstrated than 
its absence. Until a more positive 
statement can be formulated concern- 
ing the necessary and sufficient condi- 
tions for vicarious instigation, it may 
be anticipated that the present defini- 
tions will tax the experimenter's in- 
genuity for setting up experimental 
situations which eliminate sources of 
the pseudoeffects mentioned previously. 
In spite of this inadequacy, the con- 
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ceptual framework has experimental 


value. 
CONCLUSION 


This paper offers a conceptualization 
of conditioning through vicarious in- 
stigation and provides some experi- 
mental support for the phenomenon. 
The importance of the. problem is 
made evident by the paucity of experi- 
mental evidence concerning the role of 
the phenomena of empathy, envy, and 
sadism in the acquisition of interper- 
sonal behavior. The analysis of pseudo- 
vicarious instigation dramatizes the 
inadequacy of anecdotal evidence in 
this area. 

No systematic attention has been 
given here to the personality and social 
determinants of vicarious instigation, 
or to the consequences of vicariously 
instigated conditioning for social inter- 
action. Heider (1958), for example, 
identifies a wide range of common sense 
principles which could be incorporated 
into the conditioning models presented 
in this paper. One of the virtues of 
these models is that they permit the 
perceptually oriented theorist to apply 
his principles in a conditioning frame- 
work and allow the S-R learning the- 
orist to test his principles with a wider 
range of stimuli. 
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CONDITIONAL-OUTCOME CHOICE BEHAVIOR IN RATS' 


FRANK A. LOGAN 


Yale University 


A conditional-outcome choice has the 
property that, for at least one of the 
alternatives, the outcome itself depends 
upon the subject's behavior subsequent 
to the choice. Because this type of 
situation is somewhat novel, it may be 
useful to begin with an example. 

Figure 1 shows the floorplan of a 
maze in which a rat can run from the 
start (S) around one of two perimeter 
paths to the goal (G). If he turns to 
the right after leaving the start box, his 
path takes two additional right turns 
and he then arrives at a second choice 
point, shaped somewhat like a Y maze 
and exposing two swinging doors. One 
of these doors is white and the other 
black, and reward at the goal is given 
if the rat selects (say) the white door 
regardless of its position. That is to 
say, the second choice point contains 
à conventional visual stimulus discrimi- 
nation problem. Alternatively, if the 
rat turns to the left after leaving the 
start, his path takes two additional left 
turns and, at the location corresponding 
to where in the other wing he encoun- 
ters a discrimination problem, here one 
of the two guillotine doors is closed 
forcing him, half the time to the left, 
half the time to the right, through a 
grey swinging door into the goal. Typi- 
cally, he is fed in this case with the fre- 
quency that he is earning reward by his 
discrimination performance in the other 
wing. 

The important thing to recognize is 
that, for the first alternative described, 
the outcome cannot be specified in ad- 
vance. Whether or not the subject gets 
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reward depends upon what he does, in 
this case, upon whether he goes through 
the correct door. It is this feature that 
the subject has control over the out- 
come that characterizes conditional- 
outcome choices. 

This type of situation has consider- 
able interest in its own right. As a 
single but familiar example, a person 
selecting a profession cannot just con- 
sider various jobs with associated prob- 
abilities of payoffs because the payoff 
depends upon how he performs in his 
chosen field. How much one earns as 
a scientist, a doctor, or even a ditch- 
digger depends in part upon how good 
he is at that job. The outcome, in 
short, is conditional upon performance. 

But beyond this intrinsic interest, 
conditional-outcome choices have an 
important relationship to the problem 
of response definition. The discussion 
of this relationship in the next section 
will also show the logic of the re- 
searches to be reported in subsequent 
sections. 


Conditional-Outcome Choices and the 
Problem of Response Definition 


Fundamentally, the problem of re- 
sponse definition is one of deciding 
upon a rule, or set of rules, stating the 
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Fic. 1. Floorplan of the block-eight maze. 
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conditions under which behaviors that 
differ in some way may nevertheless be 
aggregated into a single response class. 
All familiar response terms are aggre- 
gates of one kind or another. For ex- 
ample, there are a great many ways 
that a rat can get from one end of an 
alley to the other but these are all 
lumped together indiscriminately in the 
expression, the rat “ran down the 
alley.” 

The conventional solution of this 
problem seems to be that responses can 
be aggregated so long as they are not 
differentially reinforced (see, e.g., 
Guthrie, 1940; Hull, 1943). Thus, no 
distinctions are made among the vari- 
ous ways of running down an alley be- 
cause the reward that is given at the 
end of the alley typically does not de- 
pend upon how the rat got there. This 
approach is schematized for a bar- 
pressing situation in Figure 2. 

In the upper left box are identified 
four responses that can occur in a two- 
bar Skinner box: the rat can press Bar 
A with the left or right paw, or can 
press Bar B with the left or right paw. 
Of course, each of these is already an 
aggregate of different postures, differ- 
ent forces, and so forth, but they will 
suffice for the purposes of discussion. 
Now the typical procedure is to rein- 
force the rat differentially with respect 
to which bar he presses but not to re- 


BAR A, LEFT PAW —8 PELLETS 
BAR A, RIGHT PAW~~~8 PELLETS 5 BAR A — 8 PELLETS 
BAR B, LEFT PAW ~+2 PELLETS 

BAR B, RIGHT PAW~——=2 PELLETS 


BAR B——-2 PELLETS 


BAR A, LEFT PAW —-8 PELLETS 
BAR A, RIGHT PAW — 2 PELLETS BAR A—-842 PELLETS 
BAR B, LEFT PAW—8 € 2 PELLETS = BAR B—-842 PELLETS 
BAR B, RIGHT PAW— 8 t 2 PELLETS 


Fic. 2. Schematization of the responses in 
o-bar Skinner box for an unconditional- 
ps e choice between the bars (top boxes) 
out conditional-outcome choice between the 
an 


bars (lower boxes). 


inforce differentially which paw he 
uses. Such a condition is indicated by 
the wavy arrows showing that the rat 
gets 8 pellets for each of the Bar A re- 
sponses and 2 pellets for each of the 
Bar B responses. According to the 
rule that responses that are not differ- 
entially reinforced can be aggregated, 
the description can be simplified to that 
shown in the upper right box. In this 
case, information about which paw the 
rat used is completely suppressed and 
one simply refers to Bar A responses 
and Bar B responses. 

The justification for this procedure is 
the same one that applies to the forma- 
tion of any meaningful scientific con- 
cept, namely, that responses defined in 
this way enter into reasonably simple 
and consistent lawful relationships with 
other concepts (see, e.g, Spence, 
1948). The substantial literature on 
choice behavior provides ample evi- 
dence of the usefulness of this pro- 
cedure. 

Suppose, however, that the reinforce- 
ment conditions were arranged as 
shown in the lower left box. In this 
case, the rat is differentially reinforced 
with respect to which paw he uses on 
Bar A, getting 8 pellets for left-paw re- 
sponses and 2 pellets for right-paw re- 
sponses. On Bar B, however, the re- 
ward is sometimes 8 and sometimes 2 
pellets but this variation is not condi- 
tional upon which paw the rat uses. 

Specifically, the question being 
raised is this: can the two Bar A re- 
Sponses be aggregated even though 
they are differentially reinforced? Can 
We suppress information about which 
paw the rat uses, and even ignore the 
fact that the reward is conditional upon 
which paw he uses? In short, can we. 
for purposes of predicting choice, re- 
fer to Bar A responses as shown in the 
lower right box? If so, then that re- 
Sponse receives varied reinforcement, 
Sometimes 8 and sometimes 2 pellets. 
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Again, the question of whether this 
step can be taken means, will a re- 
sponse defined in this way act like fa- 
miliar responses that only aggregate 
nondifferentially reinforced behaviors? 
Will such a response obey the same 
laws and principles as determined un- 
der conventional conditions? One way 
of asking this question is to arrange the 
reward on Bar B so that the variation 
between 8 and 2 pellets there exactly 
matches that being received on Bar A 
as a result of trial-to-trial oscillation 
in the rat's behavior. If the aggrega- 
tion is legitimate, then performance 
under these conditions can be predicted 
from familiar principles because the 
reward is not differential with respect 
to which bar the rat presses and he 
should, therefore, be indifferent be- 
tween them when given a choice. 

However, if the aggregation is not 
appropriate, if the familiar principles 
do not apply, or if new principles 
are required by conditional-outcome 
choices, then a preference might appear 
between alternatives that are other- 
wise equally reinforced. Specifically, 
for example, there might be something 
positive to a rat about having the re- 
ward under his control, in which case 
he would prefer an alternative such as 
Bar A where what he gets depends 
upon what he does. Or, with equal a 
priori possibility, such a condition 
could be aversive, in which case the 
rat would prefer not to have to make a 
differentiation on the way to the re- 
ward and so would choose an alterna- 
tive such as Bar B. The following re- 
searches have been aimed at evaluating 
these possibilities. 


METHOD 


Subjects. Naive male rats were obtained 
from a variety of commercial sources at 
90-120 days of age. They were housed in 
individual cages and were fed 12 grams of 
Purina laboratory chow daily 5-10 minutes 


after being run. Reward consisted of four 
45-milligram Noyes pellets. 

Apparatus. The studies of choice behavior 
when the outcome was conditional upon run- 
ning speed were run in the double-alley maze 
that has been described in detail elsewhere 
(Logan, 1960). Briefly, it consists of a grey 
start box leading to two parallel alleys, one 
of which is white and the other black, both 
4-feet long and equipped so that food reward 
can be delivered automatically according to 
a wide range of possible conditions. 

The studies of choice behavior when the 
outcome was conditional upon a visual dis- 
crimination or a position response were run 
in the block-eight maze shown previously as 
Figure l. This maze is grey throughout and 
is illuminated by six 7-watt bulbs spaced 
symmetrically around the maze and 18 inches 
above it. Food reward is given manually 
when appropriate by pulling a rod outside the 
maze that uncovers the food cup. 

Procedure. Trials were run in blocks of 
four or six, one trial in each block being a 
free choice trial and the remaining trials 
being forced so as to insure that the rat 
went equally often in each direction. The 
sequence of forcing was arranged so as to 
avoid learnable sequences and to counterbal- 
ance alternation tendencies. The number of 
such blocks of trials run in a day varied from 
one to six in the various studies, and the 
intertrial interval was in some cases only the 
few seconds necessary to move the rat from 
the goal to the start box and in other cases 
was the time required to run the other con- 
current rats. On every occasion that the 
same problem has been studied using pro- 
cedures differing in any of these ways, the 
results have been closely comparable and 
there is no reason to believe that the results 
are peculiar to a particular aspect of the 
procedure. 

One detail of the procedure that may be 
especially important concerns the technique 
for matching frequency of reward in the two 
alternatives. Half of the trials in each block 
were to each alternative and the rewards 
given in the unconditional alternative in any 
block of trials duplicated the rewards that 
the subject had earned in the conditional al- 
ternative on the preceding block of trials. 
Hence, there was a slight unavoidable lag in 
matching the rewards. 


Outcome Conditional upon Running 


Speed 


One way that the reward can be 
made conditional upon the rat's be- 
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ward conditional upon running speed, 
the effect is not very large. 
Correlated reinforcement versus 
matched frequency and interval of re- 
ward. Our final attack upon this prob- 
lem attempted to equate both the fre- 
quency and the interval of reinforce- 
ment in the two alleys. To do this, the 
problem in the correlated alley was 
changed somewhat, the rats in this case 
being required to run down to the en- 
trance to the goal section of the alley 
and to wait there for at least 2 seconds 
before proceeding to the goal. If they 
failed to spend at least 2 seconds in that 
foot of the alley immediately preceding 
the goal, they were not rewarded. 
This restriction on the correct response 
was made so that the guillotine door in 
the alternative alley could be closed to 
detain the rats at the same distance 
from the goal and for a length of time 
matched to the time they detained 
themselves in the correlated alley, 
Since the frequency of reward was also 
matched, there is no apparent bias in 
the reward conditions favoring one al- 
ley over the other and there should 
therefore be no Preference between 
them. 
Four rats have been run under these 
conditions and, during the first 1,024 
trials, all four developed and main- 
tained 10096 preference for the corre- 
lated alley, for the alley in Which they 
timed their delay at the entrance to the 
goal section of the alley, However, 
when the conditions were reversed for 
three of these animals, only two re- 
versed their preference within the next 
1,024 trials, the third continuing to 
choose the same direction which was 
now the uncorrelated alley. Hence, 
there is some evidence for a positive 
aspect to an alternative in which the re- 
ward is conditional upon speed, but the 
effect is either not very large or not 
permanent. 


Outcome Conditional upon a 
Discrimination Problem 


A second way that the reward can be 
made conditional upon the rat's be- 
havior is to use a visual discrimination 
problem. This situation has already 
been described, it being one in which, 
if the rat goes in one direction in the 
maze shown in Figure 1, he encoun- 
ters a second choice point at which he 
must choose between a black and a 
White door, and if he goes in the other 
direction, he is forced through a grey 
door. F requency of reward is matcued 
in the two wings and hence familiar 
principles would lead one to expect the 
Tat to be indifferent between them. 

Black-white discrimination | versus 
matched frequency of reward. Four- 
teen rats have been run with a black- 
white discrimination problem in one 
wing of the block-eight maze, and their 
combined performance is shown in Fig- 
ure 5. There were wide individual dif- 
ferences in rate of learning the discrim- 
Ination problem but the graph never- 
theless correctly reflects the relevant 
features to be seen by examining all the 
individual curves, As the discrimina- 
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tion problem was initially solved, a 
preference for the nondiscrimination 
wing appeared, only two rats choos- 
ing the discrimination wing on more 
than half the last 160 trials. But 
there was not a clear reversal in pref- 
erence when the location of the dis- 
crimination problem was reversed. 
Hence, there is some evidence for a 
negative aspect to an alternative in 
which reward is conditional upon a 
visual discrimination, but the effect is 
either not very làrge or not permanent. 

Dark-light discrimination versus 
matched frequency of reward. The di- 
rection of the effect with a discrimina- 
tion problem was opposite to that ob- 
tained when reward was conditional 
upon running speed. There are sev- 
eral differences other than the nature 
of the task that might account for this 
discrepancy. Most obviously, there is 
a difference in difficulty because a rat 
on a correlated reinforcement condition 
in the alley initially gets reward very 
infrequently and never attains more 
than 75-80% correct, whereas a rat on 
a discrimination problem begins with 
50% reward and ultimately attains es- 
sentially perfect performance. 

In an attempt to explore this possi- 
bility, the discrimination problem in its 
reversed position was made more diffi- 
cult for eight of the rats by decreasing 
the difference between the brightnesses 
of the two doors, generally by converg- 
ing on an intermediate grey. This form 
of transposition was very good, the 
rats by-and-large continuing to choose 
the lighter or darker door depending 
upon the way they had originally been 
trained, but the difficulty level was in 
this way effectively increased as evi- 
denced by a decrease to about 85% suc- 
cess, There was no observable effect 
upon choice as between the discrimi- 
nation and nondiscrimination wings of 
the maze after 1,280 trials under these 
more difficult conditions, 59% of the 


choices in the last 160 trials being to 
the discrimination side. 

Black-white discrimination versus 
dark-light discrimination. As a fur- 
ther test of the effect of difficulty of 
the task, a discrimination problem was 
then placed in both wings of the maze, 
one being the simple black-white prob- 
lem and the other, a more difficult dis- 
crimination between two greys. 
Darker (or lighter) was correct on 
both sides. In order to avoid a pref- 
erence developing between these alter- 
natives because of a difference in re- 
ward based on a difference in percent- 
age success, the frequency of reward 
in the simple discrimination was re- 
stricted to the same level that the rat 
was obtaining on the more difficult 
problem. There was no evidence of a 
preference after 1,280 trials under 
these conditions. The rats all con- 
tinued to choose the direction they had 
been preferring during the previous 
phase of the experiment, which for 
some rats meant choosing the more dif- 
ficult wing and for others, choosing the 
easier wing. Hence, problem diffi- 
culty, at least for experienced animals 
with matched frequency of reward, is 
not a critical variable in conditional- 
outcome alternatives. 


Outcome Conditional upon a Position 
Response 


A third type of situation in which 
the general problem of conditional-out- 
come choices can be tested involves the 
same maze as used for visual discrimi- 
nation problems, but requires a position 
response at the second choice point. In 
this case, the swinging doors are all 
grey and in one wing of the maze, re- 
ward is conditional upon the rat going 
through (say) the left door. In the 
other wing of the maze, the turn is 
forced and reward frequency is 
matched to the percentage correct in 
the conditional alley. 
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One difficulty with this test of choice 
behavior when one alternative is con- 
ditional upon response selection is that 
the same type of response, namely. a 
position response, is also used to de- 
termine the rat's preference. If there 
is any anticipatory response tendency, 
it could contaminate the measure oí 
preference; e.g., a preference for a left 
turn at the first choice point might ap- 

pear because a left turn is correct at 
the second choice point rather than be- 
cause that direction leads to (or away 
from) the position problem.. A similar 
complication would have arisen in the 
previous cases had a stimulus discrimi- 
nation or response speed differentiation 
been used as the indicator response. 

For example, a rat could encounter a 

black-white discrimination problem at 

a second choice point if he chooses 

white at the first choice point and not 

if he chooses black there, in which c 

the differential reinforcement of 1 

ness at the second choice point might 

transfer to the first. It may be useful 
later to repeat these studies using dif- 
ferent indicator responses, 

However, there are several reasons 
for believing that this problem is prob- 
ably not serious in the present instance, 
There is evidence (Logan, 1961) that 
learning is quite specific to its location 
in this apparatus, and unpublished re- 
search by students failed to detect 
an anticipatory response tendency. 
Furthermore, detailed examination of 
the present data did not reveal any con- 
sistent relationship between prefer- 
ence at the first choice point and the 
correct turn at the second. 

Seven rats have been run on this 
type of problem and Figure 6 shows 
the results for four of them for whom 
location and correct position were 
counterbalanced. Only one rat showed 
a strong preference for the conditional- 
outcome alternative but the overall pic- 
ture leans somewhat in that direction. 
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rats that encountered a position problem in 
one wing of the block-eight maze and were 
given a matched Írequency of reward in the 
other wing. (Each break in the dashed 
curve indicates where the correct turn at the 
position problem was reversed.) 


Hence, there may be a slight positive 
aspect to an alternative in which re- 


ward is conditional upon a position 
response, 


Discrimination Problem versus 
Position Problem 

Since the evidence suggested the 
possibility of Something positive about 


à position problem and Something neg- 
ative about a discrin 


nination problem, 
when each was compared with an un- 
conditional Outcome, it seemed reason- 
able that these effects might appear 
more clearly if they were pitted against 
each Other. Toward this end, a black- 
white | discrimination problem was 


placed in one wing of the block-eight 
maze anda 


n Position problem was placed 
in the other wing. 

hese conditions could not be used 
from the beginning of training because 
the two wings would be differentially 
reinforced. To avoid this difficulty, the 
Tats were run through three phases 
Preceding the testing phase. The first 
Phase entailed Tunning for 864 trials 


m just one Wing to a discrimination 
problem or, for | 


half the rats, a position 
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problem. The second phase entailed 
running for 864 trials in the opposite 
wing to the other problem. Because 
the position problem could be learned 
more quickly than the discrimination 
problem, the rats were given several 
reversal series in the position wing so 
that they received about the same num- 
ber of reinforcements in each wing dur- 
ing the first two phases. The third 
phase entailed integrating the two 
problems, the rats being given an ir- 
regular series of 192 forced trials half 
in each wing with the appropriate 
problem in place. Four of the six rats 
made this transition to the integrated 
problem with only a few errors, but 
the other two had to be returned to a 
previous phase and then brought back 
to the integrated problem. 

Hence, by the time of the fourth 
phase during which the customary se- 
quence of free and forced trials was 
followed, the rats had solved both prob- 
lems, gone each direction equally often, 
received comparable reinforcement fre- 
quencies in each wing, and currently 
were performing essentially perfectly 
on both problems. There is thus no 
basis in familiar principles to predict a 
preference. 

None of the six rats showed a strong 
preference on the free trials during the 
fourth phase. Indeed, they were con- 
spicuously indifferent, overall percent- 
age choice of the position problem 
being 55%. Hence, there is no evi- 
dence that the nature of the problem 
makes a difference in conditional-out- 
come choices so long as frequency of 


reward is equated. 


Discussion 


The reader may wish to emphasize 
those few instances in which we have 
obtained preferences that do not appear 
to follow from familiar principles. 
However, it now seems likely that the 
deviations encountered were either 
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chance ones or the result of some arti- 
fact. Since the matching of rewards 
entailed a slight lag, any increase in 
frequency of reward appeared first in 
the conditional alternative and this dif- 
ference could be sufficient to produce 
a weak preference for conditions of 
correlated reinforcement. The tempo- 
rary preference away from a discrimi- 
nation problem might simply reflect 
the fact that, during one stage of learn- 
ing the rats had to pause to look back 
and forth at the discriminanda (see. 
e.g., Tolman, 1955), and the condi- 
tional alternative therefore took a mo- 
ment longer to run. Certainly, if a 
powerful new principle were operating 
in conditional-outcome choices, its ef- 
fect should have appeared more clearly 
in the various types of researches re- 
ported. Hence, the favored conclusion 
is that decisions in conditional-outcome 
choices can be understood on the basis 
of the same principles that govern non- 
conditional-outcome choices. 

Of course, additional research is 
needed to establish the generality of 
this conclusion, Other procedures, 
particularly ones in which the condi- 
tional aspect is very prominent, may be 
more likely to detect a preference, and 
other organisms, particularly primates, 
may be more likely to reveal one. 
Furthermore, although a micromolar 
theory (Logan, 1960) implies that 
choice is the fundamental measure of 
differential response tendency, it is im- 
portant to determine whether there are 
differences between the types of al- 
ternatives in measures such as resist- 
ance to extinction or in the effect upon 
them of variables such as drive. The 
results of these researches could devi- 
ate from the present findings but they 
would, at the same time, provide in- 
teresting comparative and methodo- 
logical evidence. 

Somewhat tentatively, then, it is con- 
cluded that there is no preference be- 
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7 conditional-outcome alterna- 
d one with the same frequency 
of reward uncorrelated with perform- 
ance. This conclusion in turn supports 
the thesis that behaviors can be aggre- 
gated into a single response class even 
though there may be differential rein- 
forcement among them.’ The selection 
of a response depends upon the out- 
comes actually being encountered with- 
out regard to the fact that those out- 
* comes may depend in some way upon 
how the subject performs subsequent 
to the choice. Accordingly, it is pro- 
posed that there are no restrictions 
within the conditions of reinforcement 
on the formation of response classes. 


SUMMARY 


A conditional-outcome choice is one 
in which the reward for at least one of 
the alternatives depends upon the or- 
ganism’s performance, A series of 


? This thesis is not antithetical to the mi- 
cromolar assumption that responses differing 
quantitatively are to be treated as different 
responses. The present Position is that one 
can aggregate responses of different speeds 
into a single class for the Purpose of predict- 
ing whether the rat will run down one alley 
or another, but the speed 


ier, b of running will not 
necessarily index the Strength of that re- 
sponse. When different responses are aggre- 


gated into a single class, whether they are 
differentially reinforced or Not, no statements 


can be made about differences among the 
responses within that class. 
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studies was reported in which rats 
were given a choice between two alleys. 
in one of which they had to run slowly. 
choose the correct brightness, or turn 
in the correct direction in order to ob- 
tain reward. It was concluded that 
rats are indifferent between such an al- 
ternative and one in which the same 
frequency of reward was given regard- 
less of performance. This finding was 
interpreted as indicating that distin- 
guishable behaviors can be aggregated 
into a single response class even though 
they are differentially reinforced. 
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PSYCHOLOGICAL REVIEW 


OBSERVATIONS ON ACTIVE TOUCH ' 


JAMES J. GIBSON 


Cornell University * 


Active touch ngiers to what is ordi- 
narily called toucMing. This ought t 
be distingushed from passive touch, 
or being touched. In one case the 
impression on the skin is brought 
about by the perceiver himself and in 
the other case by some outside agency. 
The difference is very important for 
the individual but it has not been em- 
phasized in sensory psychology, and 
Particularly not in the experimental 
literature. ; 
Active touch is an exploratory 
rather than a merely receptive sense. 
When a person touches something with 
his fingers he produces the stimula- 
tion as it were. More exactly, varia- 
tions in skin stimulation are caused 
by variation in his motor activity. 
What happens at his fingers depends 
on the movements that he makes— 
and also, of course, on the object that 
he touches. Such movements are not 
the ordinary kind usually thought of 
as responses. They do not modify the 
environment but only the stimuli com- 
ing from the environment. Presum- 
ably ‘they enhance some features of 
the potential stimulation and reduce 
Thev are exploratory instead 
f In this respect, 


others. 
of performatory. 


1 This research was supported in part by 
the Office of Naval Research under Con- 
tract NONR 401 (14) with Cornell Uni- 
versity. 


these touching movements of the fin- 
gers are like the movements of the 
eyes. In fact, active touch can be 
termed tactile scanning, by analog, 
with ocular scanning. 

By means of active touch a great 
many gproperties of the adjacent en- 
vironment can be perceived in the 
absence of vision. The blind must de- 
pend on it for most of their informa- 
tion about the world (Revesz, 1950). 
Nevertheless, despite its importance, 
the “sense of touch” (Boring, 1942, 
Ch. 13; Geldard, 1953, Ch. 9-12) has 
been studied by sensory physiologists 
only as a passive or receptive channel. 
It is treated as a part of cutaneous 
sensitivity. The sensitivity of the 
skin, which includes temperature and 
pain as well as touch, can be most 
easily investigated by the applying of 
stimuli to the cutaneous surface. At 
a perceptual level, Geldard (1957) has 
applied multiple stimuli to the skin and 
has thereby demonstrated some of the 
various “messages” that the skin will 
transmit but he is nevertheless con- 
cerned with a mosaic of receptors, not 
an exploratory organ. Katz (1925) 
and Revesz (1950) have argued, how- 
ever, that the hand is a kind of sense 
organ as distinguished from the skin 
of the hand. Katz began the descrip- 
tion of tactual experience as it occurs 
in life, and performed experiments on 
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of the remarkable discrimina- 
pone ssible. Revesz, ob- 
tions that are po blind 
serving the performances of the a , 
proposed an unrecognized mode of ex- 
perience called haptics which goes be- 
yond the classical modalities of touch 
and kinesthesis. His interest is in 
philosophy and esthetics. These two 
seem to be almost the only investiga- 
tors who have given much thought 
to what is here called active touch. 
Their work has not been followed up 
by experimenters and the term itself 
has never been widely used, perhaps 
because it does not fit with the estab- 
lished ideas of what constitutes a sense. 

The relation of touch to hinesthesis. 
It seems to have been assumed by psy- 
chologists that active touch is simply 
a blend of two modes of sensation, 
kinesthesis and touch proper.- That 
is, touching merely combines the data 
from the feeling of movement and the 
feeling of contact, the two sensations 
being fused into one experience. This 
assumption, however, 
First, it fails to tak 
purposive character of touching and, 
second, it fails to emphasize the multi- 
plicity of so-called kinesthesis, 

The first objection is that the act of 
touching or feeling is a search for 
stimulation or, more exactly, an effort 
to obtain the kind of stimulation which 
yields a perception of what is being 
touched. When one explores any- 
thing with his hand, the Y 


movements 
of the fingers are purposive. An 
organ of the body is being adjus 


ted 
for the registering of information. The 
limbs and extremities are, of course, 
motor organs as well as sensing or- 
gans, whereas the eyes are only sense 
organs, but the function of motor per- 
formance can be subordinated to the 
function of exploratory adjustment in 
the case of the limbs. 

Behavior theorists have recently 
been emphasizing the importance of 


can be criticized, 
€ account of the 
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what is variously called "feedback" 
from responses, or self-produced stimu- 
lation, or proprioception in general. 
They recognize that such inputs are 
necessary for the purposive control or 
steering of behavior. What they have 
not recognized is that some of the 
stimulation produced by responses 
yields objective information as well 
as subjective. The stimulation pro- 
duced by sense-organ adjustment is 
of this sort. In active touch, as will 
later be suggested, the flow of stimu- 
lation contains two components, one 
exterospecific and one propriospecific. 
But these components are not the 
traditional sensations of touch and 
kinesthesis respectively. The hy- 
pothesis will be advanced that the 
purpose of the exploratory movements 
of the hand is to isolate and enhance 
the component of stimulation. which 
specifies the Shape and other charac- 
teristics of the Object being touched. 

A second objection is that the term 
kinesthesis itself means different 
things and should not continue to 
imply the existence of a unitary sense. 
Historically the word refers to move- 
ment of the body, and originally meant 
the "muscle sense.” But it is known 
that there is sensitivity to the position 
of the body and of all its members 
relative to one another. This is an 
articular sense, not a muscle sense. 
and the joints yield information about 
joint position as well as joint rotation 
(Goldscheider, 1898). Probably there 
1S no “sense” of muscle contraction as 
such, the joint initiating the spatial 
information and the muscle spindles 
having only reflex coordinating func- 
tions (Rose & Mountcastle, 1959). 

ere is Sensitivity for the upright 
Posture of the head, the linear and 
angular accelerations of the head, and 
a general “sense” of the equilibrium 
9f the body. There is sensitivity of 
some kind for the force exerted by a” 
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individual, whether or not this is ac- 
companied by hypothetical "feelings of 
innervation" (Boring, 1942, Ch. 14). 
There is also, of course, visual sensi- 
tivity to the changes of position of 
one's body in space. And there is pro- 
prioception in general however this 
may be defined, as well as somaes- 
thesia. 

The single term “kinesthesis” can- 
not carry the burden of all the mean- 
ings that have been added to its origi- 
nal meaning. It fails to do justice to 
different inputs, to different combina- 
tions of these inputs, and to different 
functions of these combinations. 

Types and subtypes of anatomical 
receptors involved in active touch. 
Passive touch involves only the ex- 
citation of receptors in the skin and its 
underlying tissue, although the pat- 
terns may be elaborate. But active 
touch involves the concomitant excita- 
tion of receptors in the joints and 
tendons along with new and changing 
patterns in the skin. Moreover, when 
the hand is feeling an object, the 
movement or angle of each joint from 
the first phalanx of each finger up to 
the shoulder and the backbone makes 
its contribution. And these inputs 
occur relative to a continuous input 
from the vestibular organs, along with 
the cutaneous input from contact with 
the ground. Presumably the feeling 
of an object by the hand involves the 
feeling of the position of the fingers, 
hand, arm, body, and even the head 
relative to gravity, all being integrated 
in some hierarchy of positional infor- 
mation. 

The total flux of stimulation is enor- 
usly complex, but lawful modes of 


mo 
combination occur. Presumably the 
modes of combination of these inputs 


specify the difference between touch- 
ing and being touched. Also certain 
Special combinations probably carry 
information about the object being 


touched. In short, the so-called sense 
of touch as it is most often employed 
by using the hands involves the play 
of inputs coming from the whole 
skeleto-muscular system. Whether it 
is one sense or many is therefore a 
matter of dispute. Tt depends on what 
one wishes to call a sense. "There is 
no single organ or structure analogous 
to an ear or an eye. There is no 
unique quality of sensation analogous 
to warm, cold, pain, sour, sweet, or 
red. Active touch does not fulfill the 
supposed criteria for a single sense 
modality. Nevertheless it provides a 
quite definite channel of information 
about the external environment. It is 
a type of perception that is isolable 
from vision, audition, taste, and smell, 
and it needs to be studied in its own 
right. 

Mechanics of cutaneous stimulation, 
proximal and distal. The literal or 
proximal stimulus for touch, the im- 
mediate cause, is some sort of de- 
formation of the skin, not necessarily 
a simple depression of the surface as 
by a tactile “stimulator,” but almost 
any deformation of the elastic tissue. 
Pulling on a thread glued to the skin 
is just as effective a stimulus as push- 
ing it in. The stimulus has to be a 
change in time, that is, a motion of the 
skin. Since the work of Nafe and 
Wagoner (1941) it has been suspected 
that whenever the skin becomes abso- 
lutely motionless even in some abnor- 
mal conformation, the receptors cease 
firing and the stimulus becomes inef- 
fective. Outside of the laboratory this 
situation would be rare. A precisely 
constant pressure of an object relative 
to the skin would not occur, if only by 
reason of tremor or body sway. 

The cause of elastic change of the 
skin’s surface can be almost any me- 
chanical event. Some work must be 
done, even if slight, but the actual me- 
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i happenings are so various 
oF em that the textbooks E 
mechanics do not begin to descri s 
them. A common event is impact wit 
a solid substance, but a puff of air or 
a drop of water will do as well In 
hairy regions of the skin, even the 
slightest bending of a hair gives a 
stimulus. Some of the types of me- 
chanical events that stimulate the skin, 


in terms of everyday physics, may be 
exemplified as follows. 


1. Brief events: Pressure, push, slap, 
pat, tap, prick. Note that these vary 
in area as well as in duration, 

2. Prolonged events without dis- 
placement : vibration, stretching, 
kneading, pinching, 

3. Prolonged events w 
ment: Scratching, 
sliding, brushing, r 
these moving defor. 
degree of friction 


ith displace- 
scraping, rubbing, 
olling. Note that 
mations vary in the 
between the two 


surfaces, and also in the degree of 
lateral Stretching of the skin as well 
as the de 


gree of vertical depression, 


These are relatively 
events as compared to 
events that excite the 
organs imbedded in the 


the latter are also mechanically com- 


plex. The ways in which mechano- 
receptors are made to fire nervous 


impulses need not be considered here. 
The facts, so far as they are known, 
are to be found in the Physiological 
literature, 

The external causes of these me. 
chanical events, the Sources of poten. 
tial cutaneous stimulation, are the ob- 
jects and surfaces of the environment, 
"They are at the distal end of a sequence 
that leads to the excitation of nerve 
cells. An objective Surface of some 
kind and the cutaneous Surface must 
"touch," either by movement of an 
object with reference to the stationary 


large scale 
the 


tissue, 
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skin or by movement of the skin with 
reference to a stationary object. 
Contact with the substratum. The 
“sense of touch” is not ordinarily 
thought to include the feeling of cu- 
taneous contact with the earth, Never- 
theless, the upward pressure of the 
surface of support on some part of 
the body provides, for every terres- 
trial animal, a constant background of 
stimulation. It is covariant with the 
continuous input of the statocyst re- 
ceptors of the inner ear already men- 
tioned. Together they provide what 
the ordinary person calls the “sense 
of support." The axis of gravity and 
the plane of the ground provide the 
basic frame of reference for tactual 
Space perception, Active touch, as will 


appear, yields clear perceptions of en- 
vironmental Space in the absence of 
vision, 


The touchin 
neither active 
defined above, 
ing of a st 


& of the substratum is 
DOr passive touch as 
ie. neither the touch- 
ationary object nor being 
touched by a Moving object. It is 
instead a means of registering the 
stable environment with reference to 
Which both the movements of one's 
body and the motions of objects occur. 


OBSERVATIONS OF THE ACT OF 


Toucutinc ? 
In the sim 


ple experiments to be de- 
scribed, visi 


on was always excluded. 
accomplished by 
bserver put both 
h curtain at the 
A a table, not by blindfolding 
him, or working in darkness. His 
manual could thus be observed. 
ect of any sort is placed 
the observer or when 
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Susan Stanle 
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his hand is placed on it, the following 
facts can be noted. He tends to bring 
the other hand to the object if allowed 
to do so. He tends strongly to curve 
his fingers around the object if per- 
mitted. He tends to move his fingers 
over the object in complex ways. 
Some of these are: tracing movements 
with one fingertip or several fingertips, 
opposition of the thumb and other 
fingers, and rubbing, grasping, or 
pressing movements of the fingers. 
Finally, he tends to name the object 
if he can do so, or to compare it with 
a familiar object if he cannot. Non- 
sense objects are almost necessary for 
such observations since the observer 
does not persist after recognizing 
something. 

The observer seems to be trying to 
obtain mechanical events at the skin 
at various places in various combina- 
tions. The movements by which he 
obtains them do not ever seem to be 
twice the same, but they are not aim- 
less. If the hand is conceived as a 
sense organ, he seems to be adjusting 
it. He appears to be searching for 
stimulus information. 

So much for the manual activity. 
Reports of experience during active 
touch can also be obtained. They 
tend to be radically different from 
reports of experience during passive 
touch. The following facts, noted by 
the writer, are confirmed by other 
observers in this experiment, or are 
consistent with earlier observations. 

Unity of the phenomenal object. 
When feeling a single object with two 
fingers, only one object is perceived 
although two separated cutaneous 
pressures occur. „The separate “local 
signs” are not noticed. This holds true 
not only for finger-thumb opposition 
but also for other combinations of 
fingers. The unitary perception oc- 
curs when all five fingers are applied 
to an object, and even when two sepa- 


rate hands are applied to the object. 
In fact, 10 different digital pressures 
all at the same time yield a wholly 
unified experience. There are some 
limitations on this unity, as can be 
observed with Aristotle’s illusion when 
a pencil is held between crossed fing- 
ers, but even then it is very hard to 
overcome, by the writer's introspec- 
tion. This fact needs further investi- 
gation. 

Stability of the phenomenal object. 
When sliding the skin over a corner 
or protuberance of an object, the dis- 
placement of the cutaneous pressure, 
the “tactile motion,” cannot usually be 
noticed. The object seems to remain 
stationary even though the impression 
moves relative to the skin. Not only 
the object but also the whole space 
of the table, chair, floor, and room is 
felt as perfectly stable. This phenome- 
non has been studied in an experiment 
reported in the next section. 

Rigidity or plasticity of the phe- 
nomenal object. When pressing a 
finger on a rigid surface or squeezing 
an object with the hand, it is difficult 
to notice the increase of intensity of 
cutaneous sensation; instead the ob- 
server is primarily aware of the sub- 
stance and its resistance. Likewise. 
when pressing or squeezing a nonrigid 
object (a lump of modeling clay, a 
rubber ball, a handful of cloth) the 
observer is aware of the yieldingness, 
elasticity, or softness of the substance, 
not of the (very different) intensity 
of the back pressure on the skin. This 
fact deserves further study. Presum- 
ably the degree of force exerted is 
heing registered by the joint-and-ten- 
don receptors and the resulting degree 
of skin-pressure is being registered by 
the cutaneous and deep-tissue recep- 
tors, but the intensities are not experi- 
enced as such. Perhaps it is the rela- 
tion between these different intensities 
that is registered. The proportion of 
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the same retinal stimulus. ares 
the problem is analogous in tactua 
tion, although the solution can 
alee carats dier When an 
hardly be quite the same. V 
individual actively moves his hand 
relative to his body, and thus relative 
to gravity and the surface of support, 
he is in contact with the earth as well 
as with the object, whether it be mov- 
ing or motionless. A receptive sys- 
tem of higher order is excited during 
activity than the system during passiv- 
ity. Hence the stimuli which have one 
excitatory capacity for a receptive sub- 
system in passive touch will have a 
different excitatory capacity in active 
touch—a different specificity. The 
relations between stimulation of the 
subsystems become significant. A law. 
ful covariation of the inputs carries 
information different from that carried 
by isolation of the inputs, and yields 
different perceptions. Johannes Mül- 
ler’s law of the specific conscious qual- 
ity of each excited nerve is evidently 
much too simple, 
To lump one set 
tems together as touch and another as 
hinesthesis, then, is to Obscure the 


function of the Systems in combination, 
If both had their own unique quality of 
experience there w 


ould be some justi- 
fication for the traditional belief in 
separate senses, but they do not. Ana- 
tomically different receptors may serye 


the same function and arouse the same 
experience, and, moreover, may serve 
different functions in different combi- 
nations. It might be better to drop 
the term sense and Speak instead of 
“esthesic systems.” 

It should be noted that nothing 
has been asserted about performatory 
movements of the hand, as when an 
object is relocated or a tool is wielded, 
The perception is then subsidiary to 
action and the receptive systems take 
on still another function not here con- 
sidered. 


of receptive sys- 
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PERCEPTION Or THE TEXTURE, SUB- 
STANCE, AND SHAPE OF AN 
Oxyecr nv Toucn 


Katz (1925) showed that the tex- 
ture of a solid substance could be per- 
ceived by touch if there was relative 
motion between the surface and the 
skin. Ordinarily this means rubbing 
the fingers over the surface; perpen- 
dicular motion was very much less 
effective than lateral motion. A sub- 
stance could be identified by its tex- 
ture; the observers could distinguish 
a dozen kinds of paper including blot- 
ting paper, wrapping paper, and writ- 
ing paper. Katz also demonstrated 
the perceptibility of other physical 
Properties of a surface besides rough- 
smooth, including soft-hard, and vari- 
ous kinds of plasticity, elasticity, or 
viscosity, 

Katz was primarily interested in the 
fine structure of the surface of an 
object, and the substance of which it 
was composed. He had less to say 
about the shape of a substance, which 
is perhaps the main characteristic of 
an object. Active touch, however, is 
an excellent channel of spatial infor- 
mation in that the arrangement of sur- 
faces is readily picked up. The solid 
geometry of things is best got by feel- 
ing them. A number of observations 
illustrate this assertion, 

Curvature as against planarity of 4 
Surface, Departure of a surface from 
the Plane can be judged by an observer 
either by olding his hand against 
the Surface or by running his fingers 
Over it. The Opposite qualities of con- 
Vexity and concavity are immediately 
given. na smaller scale, protuber- 
ances or indentations on a plane surt- 
face are identified, and this acuity 15 
taken advantage of in Braille. More- 
Over a surface with positive curvature 
9n one axis and negative curvature on 
the other is Perceived as saddle-shaped. 
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The accuracy of all these judgments 
could be but has not been measured. 

Slant of a surface with respect to 
gravity. When the observer's hand 
is placed on an adjustable palm-board 
outside the field of view, it is found 
that he can judge its slant with pre- 
cision. He can nullify this slant and 
adjust it to a normal plane. This 
holds for all three planes of space, the 
frontal, the saggital, and the hori- 
zontal. Variable and constant errors 
for settings to the normal plane seldom 
exceed 2?-3?. Furthermore, the slant 
of a surface that is visible only can 
be equated to the slant of a surface 
that is tangible only. The cross-modal 
matching of slant is accepted as nat- 
ural by all observers, and the manual 
judgment can be used as a criterion 
of visual perception. (Gibson, 1950). 

When the palm-board is maintained 
at a slant from the frontal upright 
plane for some time, "normalization" 
seems to occur, and a negative after- 
effect ensues. In this respect, the 
tangible slant of a surface behaves like 
the visible slant of a surface (Berg- 
man & Gibson, 1959). 

Parallelity of two surfaces. If a 
pair of adjustable palm-boards is held 
between the two hands, behind a cur- 
tain, they can be adjusted to the paral- 
lel position with small errors. The 
divergence or convergence of the two 
planes are felt as opposite qualities of 
a simple spatial experience. This di- 
mension is also susceptible to normal- 
ization; after a few minutes of feeling 
divergence (say) the setting accepted 
as parallel is found to have shifted 
several angular degrees toward diver- 
gence. This fact has been studied in 
detail (Gibson & Backlund, 1963). 

Distance between two surfaces. The 
span of the hands holding an object 
between them is experienced vividly. 
The tangible extent can be matched 


with a visible extent or vice versa. 
Likewise the distance between the 
thumb and one or more of the opposa- 
ble fingers is perceived, and is accu- 
rately matched with the width of a 
visible object. According to Kelvin 
(1954) the cross-modal size compari- 
son is made with no less accuracy than 
either unimodal size comparison alone. 
The finger-span of one hand can be 
immediately felt as equal or unequal 
to the finger-span of the other hand, 
and this comparison has been utilized 
by Kóhler and Dinnerstein (1949) in 
studying a phenomenon they termed 
the kinesthetic figural aftereffect. In 
the many repetitions of this experi- 
ment the equivalence of the two hands 
for perceiving a spatial extent seems 
to have been taken for granted without 
its being studied in its own right. 

Plane-angle, or edge of a surface. 
To exploratory touch, a junction of 
two planes constituting an edge is very 
noticeable. The angle is perceptible 
as varying from sharp to dull. At the 
extreme case of a knife-edge, sharp- 
ness is more accurately judged by 
touch than by vision. 

Edge-angle, or corner of a surface. 
Where two edges intersect, or where 
three or more planes adjoin, a corner 
is produced. In the case when the 
angles of a convex corner are acute 
it is apt to be called a point. Corners 
and edges seem to be the main identi- 
fying feature of many objects. The 
fingers appear to seek them out and 
explore them, 

Discussion. Planarity, curvature, 
slant, parallelity, span, edge, and cor- 
ner may be conceived as variables of 
solid geometry. They are all percep- 
tible to active touch, and appear to be 
simple qualities of experience. Since 
they are amenable to physical meas- 
urement, psychophysical experiments 
would be possible, but few have been 
performed. 
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TactuaL Form PERCEPTION: 
A Comparison or Active 
AND Passive Toucu 


The study of the perception of ob- 
jects by touch has heen largely con- 
fined to investigations of two dimen- 
sional form or pattern on the skin, 
with passive stimulation (eg., Zigler 
& Barrett, 1927), They are in the 
tradition of experiments on cutaneous 
localization and the two point thresh- 


old, the assumption being that the skin 
is first of all a rec 


were used; the 
discrimination, 
forms in this m 


Everyone knows, however, that a 
better way to Perceive a thing of this 
sort is to run one’s fingers over the 
edges. The fact is often taught to be. 
ginning students, and is even demon- 
strated in laboratory exercises (Lang- 
feld & Allport, 1916) but a Search of 
the experimental literature failed to 
locate its source. Lashley (1951), for 
example, asserts as if it were common 
knowledge that the same shape which 
can scarcely be distinguished with 
passive touch “can readily be distin. 
guished when explored by tactile Scan- 
ning" (p. 128). “An experiment was 
performed to quantify this assertion, 
since no authority for it could be 
found. Each of a set of tactual forms 
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The six forms to be identified by touch. 


had to be matched to its visual equiva- 
lent. The accuracy with which this 
could be done was compared when the 
form was pressed into the palm of the 
hand and when it was held above the 
palm and explored by the fingertips. 


Method 


The sources of stimulation were bent 
metal strips with handles, the edges having 
the forms shown in Figure l. They were 
in fact small "cookie cutters" with a mean 
diameter of 1 inch. The six used were 
Chosen for being mutually different in about 
the same d No two had the same 
or points. Each subject 
of stimulators and the 


He then 
up on the table behind 
S either touched by the 
Wed to touch it for sev- 
In the one condition, he was 


instructed to open and relax his hand; in the 


other to cup 


Pare to feel the edges He then had to 
identify the form by number. The stimu- 
ator w 


as not necessarily presented in the 
S the drawing, Fe 
Were presented five times 


: The order was 

9 preliminary practice or knowl- 
edge of results Was given. Twenty subjects 
Were tested, 


Results, A chance level of judg- 
ments would 


In the active Condition, the subject 
Was permitted to explore the edges im 
any manner he chose. One style was 
to use all five fingertips simultaneously, 
another was to Move the tip of the 
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index finger around the edge, and 
others were intermediate. All sub- 
jects, however, actively moved their 
fingers and thus obtained relative mo- 
tions between the skin and the object. 

In the passive condition, the form 
was pressed down on the palm by 
manual action of the experimenters so 
that a certain amount of unsteadiness 
was inevitable. When subsequently, 
with 20 different subjects, a mechani- 
cal lever system was used to apply the 
stimulation the mean frequency of cor- 
rect matches fell to 29%. 

Some subjects reported that they 
tried to count the number of corners 
or points they could feel, and that this 
was easier with the active than with 
the passive condition. 

Discussion. A common belief has 
been verified but, in a sense, no claim 
can be made for the conclusiveness of 
the comparison. It could be argued 
that the fingertips are more "sensitive" 
than the palm of the hand, meaning 
that localization is better and the two- 
point threshold lower. This is true. 
The comparison is not very meaning- 
ful. The real point of the experiment 
is that tactual form perception in one 
meaning of that term does not depend 
on the pattern of local signs on the 
skin. If any number of fingertips 
from one to five can be used indiffer- 
ently, the chaotic pattern of local signs 
resulting can hardly be the sole basis 
of the perception. With active touch 
no forms existed on the skin, but only 
a changing pattern of pressures. Evi- 
dently the registering of "form" by 
touch is not as straightforward a proc- 
ess as experimenters have supposed. 
Another experiment may help to clar- 


ify the matter. 


TRANSPOSABILITY OF FORM ON 
THE SKIN 


Gestalt theorists have emphasized 
the fact that a form can be moved over 
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the retina, completely changing the set 
of receptors stimulated, without chang- 
ing the perception. The contribution 
to perception of the anatomical pattern 
of excitation becomes a puzzle. The 
same is true of the skin. 

Along with the experiment just de- 
scribed, another was carried out at the 
same time in order to compare passive 
static form perception with passive 
moving form perception, In these 
trials the “cookie cutter,” instead of 
simply being pressed into the palm of 
the hand, was pressed and continually 
rotated clockwise and counterclock- 
wise with a twisting motion. The 
same area of skin was stimulated in 
hoth cases and the same anatomical 
receptors were available, but in the 
latter case the form of stimulation 
changed from one instant to the next. 

Judgments of the form of the object 
were more accurate when it was ro- 
tated than when it was not, the mean 
frequencies of correct matches being 
72% and 49%, respectively. The dif- 
ference is significant at the 2% level. 
To the writer’s introspection, the cor- 
ners of the object seemed to “stand 
out” better when it was twisted. In 
short, the form of the object seemed 
to become clear when the form of the 
skin-deformation was most unclear. 

Discussion, Continuous change of 
the proximal stimulus with no change 
of the rigid object constituting the 
source of the stimulus is an old prob- 
lem in visual perception. Why does 
the perception correspond to the form 
of the object instead of to the form of 
the stimulus? Evidently a quite simi- 
lar problem arises for tactual percep- 
tion. The paradox is even more strik- 
ing, for tactual perception corresponds 
well to the form of the object when 
the stimulus is almost formless, and 
less well when the stimulus is a stable 
representation. of the form of the 
object. A clear unchanging percep- 
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tion arises when the don of sense im- 
essions changes most. 

Pe anii be that the skin does not 
have as its primary function the ue 
tering of form as this has usually pet 
conceived. Impressions of form an 
location relative to the skin might be 
quite separate from and incidental to 
the use of the extremities as sense 
organs. The informative stimuli might 
well be incorporated in the seemingly 
complex motions and transformations 
of the skin. There must exist relations 
of separation and proportion which 
remain invariant over time, and which 
are specific to the object, They do 
not represent the shape of the object 
but they specify it. The solution to 
the problem of Object perception in 
touch would then be that continuous 
change in the proximal stimulation is 
accompanied by nonchange, that is, the 
Set of invariant relations. The former 
is not noticed; the latter is separated 
out and attended to, 
The role of exploratory finger move- 
ments in active touch would then be to 
isolate the invariants, that is, to dis- 
cover the exterospecific component in 
the flux of stimulation, Only thus 
could the above paradox be resolved, 


RELATION BETWEEN Toucu 
AND VISION 


A number of similarities and differ- 
ences have been noted in the course 
of these observations between active 
touch and visual perception. Many of 
these have been pointed out before by 
other observers such as Katz ( 1925, 
1935) but a few of them have not pre- 
viously been emphasized, 

Surface perception. Solid surfaces 
are tangible and visible. This can 
only be because they have mechanical 
inertia in the first place, and because 
they reflect light with scattering in the 
second place. (A few visible surfaces 
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such as smoke are not tangible and a 
few tangible surfaces such as plate 
glass may not be visible, but most sur- 
faces are both.) The physical texture 
of a surface, the various kinds of 
roughness, can be observed either by 
vision or by touch. The color of a 
surface is visible only, not tangible, 
and consequently the pigment-texture 
of a surface (the differential coloring 
of marble or of a picture) is the same. 
On the other hand, if color is intangi- 
ble temperature is invisible. Each 
sense has its special sensitivities to the 
properties of a surface but they also 
have some in common. The bark of 
a tree looks rough without touching 
it and feels rough without seeing it. 
The expert can identify the tree by 
either sense alone, 

The visible array of surfaces ex- 
tends for miles of course, and the tan- 
gible array only to the length of man’s 
arm. But if surfaces determine space 
perception, both touch and vision are 
Spatial senses, 

Object perception, A 
substance with a topol 
surface is perceptible 
the hands and the eyes. By virtue of 
Physics, the eyes can encompass the 
closed contour of a very large object 
(depending On its distance) while the 
hands can explore only the surface of 
an object of limited Span. The unity 
of the visual Perception is said to be 
based on the "figure-ground" phe- 
nomenon, the simultaneous registering 
of the whole Contour, whereas the 
‘nity of the tactual perception has to 
be based on either cutaneously sepa- 
rated impressions Or on successive 
Impressions. Vision is Said to register 
Only the front Surface of an object 
While touch Tegisters the front and 
the back at the same time, In these 
respects vision and touch seem to dif- 
S. Nevertheless these 
exaggerated ; Succession 


detached solid 
ogically closed 
as such to both 


contrasts are 
enters into 
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the operation of both senses. The eyes 
normally fixate in succession just as 
the fingers explore in succession. With 
two eyes, and by changing one's stand- 
point, more of an object than its front 
surface is perceived. As regards the 
unity of separated impressions from 
the two hands, there is a visual anal- 
ogy in the unity of separated impres- 
sions from the two retinas. 

Spatial properties of objects. Cer- 
tain variables of the shape of a de- 
tached object, slant, curvature, edge 
and corner are independently visible 
and tangible. The equivalence of the 
two modes of perception for judg- 
ments of the object is such that dif- 
ferences got by one sense are equated 
to differences got by the other. The 
stimulus information for these cross- 
modal judgments, one depending on 
mechanical and the other on luminous 
energy, might seem hopelessly discrep- 
ant, but the fact is that the judgments 
can be made. 

Whether both vision and touch can 
register form is an ambiguous ques- 
tion, as the experiments showed. The 
answer depends on what is meant by 
form. Drawings and pictures on a 
flat surface are sensed only by vision. 
The conformation of the edges of an 
object, however, is sensed by both 
vision and active touch. The tan- 
gible and visible features of things 
abstracted by solid geometry (slant, 
curvature, edge, corner) did not seem 
any more complex to introspection 
than the solely visible features of 
things abstracted by plane geometry 
(triangle, square, circle). 

A new set of shapes has now been 
devised for the study of object-percep- 
tion by active touch. They consist of 
ten solid sculptures, Or free-forms, 
made of plastic, the surfaces being 
curved, with no planes, edges, or cor- 
ners, They are intended to be felt 
with two hands (and are called *feel- 


ies"). Approximately one-half of each 
surface (the "rear") is convex; the 
other half (the "front") consists of 
six convexities with intermediate sad- 
dles or concavities. In general, there 
are five protuberances around a cen- 
tral protuberance, but no object is 
symmetrical, either radially or bi- 
laterally. They cannot, therefore, be 
distinguished from one another by 
counting. Each is readily discrimi- 
nated from every other by vision of 
the "front" sur They are also 
mutually distinguishable by feeling, al- 
though with some error and hesitation 
for an unpracticed observer. 

Replicas of the ten objects are avail- 
able, made from the same molds. It 
is therefore possible to present one 
object to the hands and the same or a 
different object to the eyes simultane- 
ously. Preliminary results indicate 
that the cross-modal matching of these 
novel preceptions is possible even for 
a naive observer, and that practice can 
bring about errorless judgments in all 
observers so far tested. 

Passivity and activity in touch and 
vision. In passive touch the individual 
makes no voluntary movements. Simi- 
larly, in passive vision he makes no 
eye movements, which means that he 
must voluntarily fixate his eyes on a 
point specified by the experimenter. 
Neither state is natural to an individ- 
ual. In a tactual situation, the ob- 
server will explore with his fingers 
unless prevented and, in a visual situ- 
ation, he will explore the focussable 
light, fixating, accommodating, con- 
verging and pursuing. Both senses 
are normally active. The passive stim- 
ulation of the skin or the retina is 
necessary for the study of the receptor- 
cells in the skin or the retina, but the 
experiences resulting are atypical. 

[n active touching and looking the 
observer reports experiences of a quite 
different order. They correspond to 
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the environment instead of to „the 
events at the sensory surface. The 
experiences noted with passive stimu- 
lation can scarcely be noticed, if at 
all. The effective stimuli are presum- 
ably different, being relations and 
combinations selected from a flux of 
potential stimuli. Moreover, the ex- 
periences of active touching and look- 
ing are more nearly equivalent ; they 
are more alike to introspection than 
those of passive touch and vision. 

In general, the foregoing survey 
suggests that vision and touch have 
nothing in common only when they are 
conceived as channels for pure and 
meaningless sensory data, When they 
are conceived instead as channels for 
information-pickup, having active and 
exploratory sense organs, they have 
much in common. In some respects 
they seem to register the same infor- 
mation and to yield the same phe- 
nomenal experiences. 


CONCLUSIONS AND SUMMARY 


A series of observations, both intro- 
spective and behavioral, confirms the 
distinction between touching and being 
touched. The former is a channel for 
a great variety of information about 
the environment, but whether it should 
be considered one or several senses is 
a matter of definition. The simple 
formula that it consists of passive 
touch plus kinesthesis is insufficient. 
The hypothesis of two components of 
stimulation, one exterospecific and one 
propriospecific, is more promising. 

Many properties of surfaces and ob- 
jects can be registered by active touch. 
The accuracy of such judgments can 
be measured and formal experiments 
are needed. The observations suggest 
that the variables of solid geometry 
can be more profitably studied than 

those of form. Object constancy is 


characteristic of tactual perception. In 
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several respects, tactual 
perception are alike. 

In general, experimenters have not 
realized that to apply a stimulus to an 
Observer is not the same as for an 
observer to obtain a stimulus. Obtain- 
able stimuli can be controlled and sys- 
tematically varied, just as applied 
stimuli can be.  Psychophysical ex- 
periments are possible for tactual 
perception as well as for cutaneous 
sensitivity if appropriate methods are 


devised for controlling the obtainable 
stimuli, 


and visual 
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A SYSTEM FOR SHAPE RECOGNITION 
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In this paper it is suggested how cer- 
tain morphological features of the his- 
tology of the visual system may on sim- 
ple assumptions function to discrimi- 
nate distance, angle, and shape. For 
the purposes of this paper shapes can 
be regarded as a derivative of distance 
between points on the contours and 
also as a derivative of the angles be- 
tween parts of the contour. It is diff- 
cult to know whether in the human 
case it is the relative distance cues or 
the cues from angles which are more 
relied upon. In the case of a visually 
more primitive organism, such as the 
rat, though it is known that it can dis- 
criminate distance and also relative 
size, its abilities to discriminate the 
angle of one line to another or the cur- 
vature of lines remains untested. Tt is 
therefore entirely possible that such 
animals lack a mechanism to enable 
them to discriminate angles. If this is 
the case the shape discriminatory abil- 
ities of such organisms are based 
mainly on comparisons of relative dis- 
tance between contours, 

It would seem likely that in discrimi- 
nating shape animals utilize prior infor- 
mation available in the nervous system 
about distances between contours and 
about angles between contours also. 
However, the basis of the capacity to 
estimate distance between any two 
points in the visual field or the angle 
between any two contours in the visual 
field is obscure. If we wish mechani- 
cally to assess the relation between two 
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things or events such as the distance 
between two points, it is necessary 
(though not sufficient) that these two 
things or events jointly determine some 
third event. 

On the face of it, a system of fixed 
connections would appear to be very 
uneconomical in the number of struc- 
tural elements which would have to be 
employed to assess distance (Suther- 
land, 1959), Suggestions have there- 
fore been made to decrease the number 
of elements required to relate visual 
points or contours so that they can 
jointly determine some third event. 
One such way is to transform distance 
in space into distance in time by some 
kind of moving process, a notion famil- 
iar from television, One such sugges- 
tion of scanning from fixed coordinates 
external to the shape to be analyzed 
has come from Dodwell (1957) for 
the rat and Deutsch (1960) for the 
octopus. Deutsch (1955, 1958, 1960b) 
has suggested that such a conversion of 
distance into time is effected not by a 
Scan external to the shape to be ana- 
lyzed but by a wave of excitation prop- 
agated by the contours of the shape 
itself, 

However, further investigation of 
the problem has led the writer to the 
conclusion that a system relying on a 
conversion of Spatial separation into 
temporal interval is not necessary. For 
instance an examination of the neuro- 
histological data concerning the zona 
plexiforme of the deep retina in the 
octopus has led him to postulate a sys- 
tem for shape recognition which does 
not rely on these principles (Deutsch. 
19602), though no functional fixity of 
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connections has been postulated. 
There is good correspondence between 
this system and what can be seen of the 
striking morphology of the zona plexi- 
forme and the properties of the shape 
discriminating system of the octopus as 
they have been ascertained so far. 
However, the shape discriminatory ca- 
pacities of the octopus are different 
from those found among the verte- 
brates. It seems nevertheless that even 
in other cases besides the octopus it is 
possible to devise a system of reason- 
able economy in which the principle 
of transforming distance into time is 
not necessary. 


SPECIFICATIONS FOR A BIOLOGICAL 
SHAPE RECOGNIZING MECHANISM 


In a previous paper on the subject 
(Deutsch, 1955) the present author 
enumerated some of the peculiarities of 
shape recognition in animals which a 
theory should explain. These peculi- 
arities are not common to all animals, 
such as the octopus, and here special 
theories are necessary (such as 
Deutsch, 1960), but these peculiarities 
appear to be those of the general case 
of higher vertebrates. (Unfortunately 
the experimental evidence in this field 
leaves much to be desired. But if it is 
argued that the properties set down 
below have not been established experi- 
mentally to be the case, then we can re- 
gard the properties listed below as 
predictions instead from the theory 
from which they are deducible.) 

These are some of the abilities that 
a system must have before it can be 
considered a theory of shape recog- 
nition. 

1. It must be able to produce the 
same output independent of the loca- 
tion of the image on the receptor sur- 
face. Shapes retain their identity in 
spite of changes in their position in the 
visual field. 
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2. The system must produce the 
same output independent of the rota- 
tion of the shape on the receptor sur- 
face. (For some of the evidence on this 
point see Gellermann, 1933, and 
Deutsch, 1960b). 

3. A similar output should be given 
for the same geometrical shape of dif- 
ferent sizes, but in such a way that 
transfer to a larger shape is more effi- 
cient than transfer to a smaller shape. 

4. Mirror images should produce an 
identical output. They cause great dif- 
ficulty in discrimination in animals and 
children and are readily confused even 
by adults. 

It should at this stage be emphasized 
that only a partial set of visual classify- 
ing mechanisms is being suggested. 
The fact that we can perform visual 
discrimination of the tilt, position, or 
brightness does not constitute a dis- 
proof of a theory of distance, angle. 
and shape recognition. It is clear that 
the mechanism which classifies such 
properties per se must be discarding 
information about tilt, position, or size. 
It must be an abstracting mechanism 
par excellence. The fact that an ani- 
mal can visually discriminate other 
properties besides these simply means 
that it is also possessed of other visual 
mechanisms, Therefore, if an animal 
can be shown to discriminate between 
two stimuli this does not disprove the 
notion that a particular system is not 
to be found inside the animal; it could 
be supplemented by another mecha- 
nism. On the other hand an inability 
to discriminate, which would be pre- 
dicted by the theory, would tend to 
support it as would a tendency to trans- 
fer to other stimuli, identical on the 
theory, even where discrimination can 
be taught. 

Let us begin by examining a neuro- 
histological picture (Figure 1) of the 
bee's optic lobes (from Cajal & San- 
chez, 1915) which gives a structure 
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Fic. l. Neurohistological picture of the 
bee's optic lobes. 


well suited for the transformations to 
be postulated. 


As all the authors have recognised, the 
plexiform mass of this lobe appears to be 
concentrically furrowed, by means of con- 
centric bands, of a finely granulated aspect, 
which appears to Radl to constitute the ana- 
tomical characteristic of the visual centres 
of the animal series, and which remind not a 
little of the plexiform layers of the optic 
lobes of the fishes and batrachians, so mi- 
nutely described by P. Ramon (pp. 99-100, 
author's translation) 


Such a structure is not unique to the 
bee, though the particular section here 
is unusually good for the purpose of il- 
lustration. It is to be Observed that 
the lobe is divided by various hori- 
zontal layers, three of which are indi- 
cated (at g, p, and i). The retina is 
not in the section, but fibers from it 
are shown proceeding from the direc- 
tion of A (upper left-hand corner) 
which enter the lobe at right angles to 
the horizontal layers after crossing at 
the chiasma. It can be seen that in 
the first layer to be reached by the 
retinal fibers the number of termina- 
tions is large, and the size of the termi- 
nal arborizations is small hy compari- 
son with those further inside the lobe. 
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From this it is possible to infer that in 
this layer is preserved information 
about the activity of a larger number of 
ommatidia than in layers further in and 
also that the ommatidia which are rep- 
resented in this layer are closer neigh- 
bors to each other spatially in the eve 
than those represented by fibers which 
terminate on subsequent layers, The 
fibers proceeding from the retina en- 
tering further layers are fewer, pre- 
sumably from points further apart on 
the retina, and their terminal ar- 
borizations are larger than in the pre- 
vious layers, thus giving scope for 
interaction over a wider angle. So it 
seems that as we go inwards from the 
retina, progressively fewer fibers cross 
cach horizontal layer and even though 
the fibers are converging (because of 
the concentric arrangement of the lay- 
ers) the extent of the terminal arbori- 
zation at each level progressively 
increases, 

Given such an anatomical arrange- 
ment, we need to make some assump- 
tions about how the parts behave in 
order to speculate about the functions 
of this arrangement. The only as- 
sumption which is necessary is that the 
fibers in each horizontal layer act as 
spatial summators, We need only 
postulate that if two incoming retinal 
fibers which are spatially contiguous 
are also simultaneously excited, then 
this will excite fibers in the horizontal 
layer. The sum of the excitation gen- 
erated in this manner in each layer will 
reflect the number of neighboring fibers 
that are simultaneously active. R 

As the fibers which are neighbors 10 
the various layers are drawn from an- 
gular distances progressively further 
apart, there will be distinctive pat- 
terns of activity emerging from the 
horizontal layers depending on tho 
Overall distribution of widths anc 
breadths of visual areas. A totally 
uniform excitation of all the visual cle- 
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ments will cause maximum activity in 
all the horizontal layers. A visual field 
composed of contrasting areas each of 
which spans that angle of field from 
which one receptor is selected to end in 
a particular horizontal layer will pro- 
duce a minimum of activity in this 
layer. In such a case, alternate fibers 
ending in this layer will be inactive, 
and so spatial summation will not oc- 
cur. As we inspect the output from 
horizontal layers which sample closer 
neighborhoods, the rate of activity in 
these will increase as we move away 
from the layer in which activity is at a 
minimum, The same will occur as we 
move in the other direction where fi- 
bers drawn from more distant points 
end in horizontal layers further down. 

The system so far described will per- 
form a kind of crude autocorrellogram 
of an area with itself, indicating the 
number of times areas of a given size 
repeat regardless of direction. Each 
horizontal layer performs this task for 
an area of one approximate size. 
Therefore, a uniform excitation of the 
whole visual field should produce a 
maximum of excitation on all layers. 
On the other hand, a checkerboard with 
squares of a certain size should cause 
great lowering of excitation in one par- 
ticular horizontal layer, as the repeti- 
tion frequency (taking all directions to- 
gether) of an excited area will of neces- 
sity be small. It can be seen that the 
system described for the bee could not 
perform the type of discriminations to 
be seen in the rat, for instance. How- 
ever, we have supposed, in order to be 
economical in our assumptions, that it 
samples receptors from gradually in- 
creasing distances at each layer. It is 
evident that such a system would per- 
form very poorly when confronted 
with outline shapes and best when 
dealing with regularly repeated blobs 
or stripes. This property tallies with 
what is known of the bee's visual abil- 


ities. It is not supposed that the verte- 
brate system is of precisely the same 
type. The example of the bee is 
quoted in order to show that the kind 
of structure necessary to perform our 
postulated transformation is a biologi- 
cal possibility. This structure can 
without much difficulty be assigned the 
kind of task which we require. 

The higher vertebrates appear to 
take contours as the basis of their shape 
discrimination. Therefore, neighbor- 
ing arborizations of fibers at successive 
layers should be made to send a pulse 
to the horizontal layers only when both 
of them are active and a fiber should 
only be made active if it has been 
stimulated by a contour discontinuity. 
Further, fibers descending to progres- 
sively lower layers cannot merely be 
samples of retinal stimulation and pro- 
gressively larger distances, but must 
give information about the presence of 
a contour at within these distances. 
Given these alterations, we have a sys- 
tem which is capable of discriminating 
distances. A particular distance be- 
tween two retinal points will be repre- 
sented as activity in a certain horizon- 
tal layer. As a result it can be used as 
the basis for a certain kind of system 
of shape discrimination. 

System. The basic function the 
postulated system performs is to meas- 
ure the distance between all pairs of 
points on the boundary of the shape 
and so produce a distribution of how 
many pairs of points there are at each 
distance from each other. This may 
be made clearer by an example. If we 
take four points at the corners of a 
square, we can then measure the dis- 
tances between all the pairs of points. 
The histogram representing this figure 
will show a total of four at the distance 
equal to one of the sides of the square 
and a total of two at the distance equal 
to the length of the diagonals. We can 
perform a similar transformation of a 
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Fic. 2. Histogram measuring the distance 
between all the pairs of points. 
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shape given any number of points on 
its boundary to derive the curves 
shown in the illustration (Figure 2). 
As can be seen, different shapes under 
Such a transformation still remain dif- 
ferent from each other. 

Given that any two points of a cer- 
tain distance from each other will 
cause activity on one particular hori- 
zontal layer, then if there is more than 
one pair of points at that distance from 
each other, more activity on that one 
particular horizontal layer will result. 
Given a set of points at varying dis- 
tances from each other, activity on suc- 
cessive horizontal layers will show the 
distribution of these distances. By in- 
specting the output of these layers we 
have the information of. how many 
points there are at the distance from 
each other, at the distance +1, and so 
on. We thus have a system perform- 
ing the type of transformation pastu 

1 above and which also conforms 
e the specifications (except Num- 

i laid down above. It should be 
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clear that the first specification has 
been obeyed. The same output will be 
produced by the horizontal fibers inde- 
pendent of the position of the vertical 
fibers impinging on the horizontal 
layer with their arborizations, given 
that channels equal distances apart 
retinally end on the same horizontal 
layer. Similarly, rotation of the optical 
input will not produce any change un- 
less we postulate special and regular 
distortions of the distribution of the 
Teceptor surface or special collecting 
properties of the horizontal layers. 
Finally, mirror images will produce the 
Same output. The output is simply a 
measure of the distances between the 
points in a figure and these are the 
same in mirror images. 

The specification which has not been 
fulfilled is that relating to the invari- 
ance of the same shape in spite of dif- 
ferent size. This requires some fur- 
ther considerations, We have seen 
that in the optic lobe of the bee that the 
number of fibers relative to the dis- 
tance along the retina decreases at each 
Successive layer as the fibers travel 
from the retina, In this way the out- 
put from the horizontal layers which 
would be sending signals out in suc- 
cession as two lines of fixed length 
were moved farther and farther apart 
would decrease layer by layer. Now 
the successive thinning out of the fibers 
could be such that the firing from these 
layers would not decrease as the lines 
were increasing in length in propor- 
tion to their distance. If we descend 
layer by layer (measuring the number 
of neighbors at progressively larger 
distances) the output generated by 2 
small shape will be the same as the 
last portion of the output generated by 
the same larger Shape. In this way 
the pattern for a small shape will be 
Completely reproduced if we transfer 
the animal to a discrimination to larger 
Shapes, though it will be added to at 
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the beginning, whereas if we transfer 
to smaller shapes, the larger shape to 
which the animal was originally trained 
will have the first part missing. The 
problem of how the same relative mes- 
sage appearing on different horizontal 
layers is then taken and reacted to as 
the same message is still of course un- 
solved. There are various alternatives, 
but it is probably better to leave this 
problem by saying that the third speci- 
fication looks as if it could be fulfilled. 
Shape would be given by the relative 
position of the signals on the horizontal 
layers to each other, size by their abso- 
lute position on these layers. 

A remark which should be made 
about the system as it stands is that 
any coincidences (signaling that two 
contour components are within a cer- 
tain distance) will be repeated at each 
succeeding horizontal layer, though 
they will not be signaled at levels which 
measure smaller distances. There are 
ways of removing this redundant in- 
formation. For instance, we can alter 
our system in such a way that all the 
fibers from the retina are taken through 
all the layers with increasing overlap 
of arborization at each level. We can 
then set the system in such a way that 
only the boundary or end of an arbori- 
zation and the center of another ar- 
borization (that is another fiber) will 
produce a response in the horizontal 
layer when they are both on. Though 
there will not be a reduction of the 
number of fibers traveling down to 
each succeeding level in this case, we 
may still suppose that the area over 
which fibers interact to cause a signal 
in the horizontal layer increases as we 
descend downwards. 


ANOTHER MECHANISM 


The mechanism which was derived 
from the neurohistological picture of 
the bee's optic lobes is not the only 
kind of system which would produce 


the postulated transformation and sat- 
isfy the stated specifications. The sug- 
gestion may be worth making as neuro- 
histologically distinct variants of the 
same principle could have developed. 

This second mechanism would also 
generate the same signals, given the 
same shape, irrespective of its locus on 
the receptor surface and its tilt on the 
frontal parallel plane. Further, it will 
confuse mirror images in any orienta- 
tion (as a young child will). Also, 
this mechanism will reduce shapes of 
different size to a similar message in 
the same way as the mechanism de- 
seribed above. That is, the message 
for a small shape will be the same as 
one end of the message signaling the 
same but larger shape. This would be 
consistent with the well known asym- 
metry of transfer between smaller and 
larger shapes. Tt is easier to obtain 
transfer to a larger shape from a 
smaller than vice versa. When trans- 
ferring to a larger shape the animal will 
obtain the complete message it has 
learned together with an additional 
portion. However, when transferring 
to a smaller shape, some of the message 
representing the shape it has learned 
will be missing. We might therefore 
reasonably assume that transfer would 
be worse in this second condition. 

In the mechanism to be considered, 
let us assume that there is a three-di- 
mensional space filled with cells. Each 
cell on the top of this solid sends down 
connections into the solid. Each cell 
or point is conceived as radiating a 
large number of connections in all di- 
rections, but still at a constant angle. 
Each point at the top may thus be con- 
ceived as the apex of a cone, and the 
connections as lines on the surface of 
this cone converging at the apex of the 
top of the solid. (We can, for con- 
venience, think of these as lines regu- 
larly spaced on the surface of the 
cone.) When a contour is projected on 
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the top of this multicellular solid, each 
cell at the base will send down excita- 
tion into its set of connections fanning- 
out downwards as on the surface of a 
cone. If we picture such ascending ex- 
citation as black lines on the surface 
of each cone, these lines on two differ- 
ent cones will intersect at a height de- 
pending on the distance of the apices of 
these cylinders. To place a limitation 
on the number of intersections of lines 
between the two cylinders to two, we 
shall arbitrarily assume that only head- 
on collisions between lines of excitation 
will be registered as coincidences. 
'That is, only lines at a 180-degree an- 
gle to each other will be counted as 
intersecting or, in other words, lines 
intersecting at the tangent of the cir- 
cumference of the two cones. 


this not to be counted at a higher level 
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only this to be counted as coincidence 


In this way the distance of two 
points at the top of the solid will de- 
termine the height at which the excita- 
tions emanating from each will coin- 
cide. The number of coincidences at 
a particular height or horizontal layer 
will be determined by the number of 
points (or rather cells) at a particular 
distance from each other on the con- 
tour of a particular shape. Thus the 
arrangement outlined above will pro- 
duce a histogram or a spectrum for each 
particular shape, with distances along 
one ordinate and the number of points 
at each distance from each other on the 
other ordinate. The spectrum or re 
gram produced will be the same inde- 
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pendent of the position of the shape at 
the base of the solid and also independ- 
ent of its orientation. Mirror images 
will also give rise to the same spec- 
trum. The property of generalization 
to the same shape of different sizes is 
yet to be accounted for. Here it is 
possible to have recourse to the same 
kind of principle as in the first type of 
mechanism outlined above. We need 
merely suppose that the number of pos- 
sible coincidences decreases by the fac- 
tor of the distance downwards. For 
instance, there may be a certain critical 
distance between two ascending lines 
as they pass near each other within 
which a coincidence signal will be gen- 
erated if both lines are excited. As the 
lines pass downwards on the cone they 
will be progressively more widely 
spaced on the surface of the cone and 
therefore increasingly less likely to be 
within the critical distance of each 
other, even if their angle with regard 
to each other is sufficiently correct. If. 
furthermore, the coincidences are 
counted by layers progressively further 
apart (in terms of retinal distance) the 
histogram generated by a small shape 
will be the same as the end of the time 
series of the same large shape. 

The advantage of this second scheme 
is that it could be made to yield good 
discriminations between various de- 
grees of curvature. This would occur 
if we assume that once coincidence had 
occurred between two paths of descend- 
ing excitation, the excitation would 
not descend any further. In this way 
the excitation generated by a straight 
line would all coincide on the first layer 
and there would be no further coinci- 
dences whereas an increase in curva- 
ture would bring about further coin- 
cidences (Figure 3). 


Discrimination of Angle 


So far two systems have been de- 
scribed which yield discrimination © 
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distance between points. Points are 
brought into relation at some third 
point and this third point varies along 
a single continuum depending on the 
distance between the two points. Fur- 
ther, all the pairs of points with the 
same distance between them are rep- 
resented by activity in the same struc- 
ture. This structure is conceived of 
as a layer, so that a particular distance 
between any two points is translated 
into activity in a particular layer. It 
was shown that shape recognition could 
be based on such a system. However, 
as was stated above, discrimination of 
size of angle between two contours is 
also a capacity of the visual nervous 
system. It will be shown briefly below 
how the type of mechanism already 
outlined can simply be expanded to 
provide this capacity. 

It will be apparent that the rate at 
which two contours converge or di- 
verge will be reflected in the angle at 
which successive coincidences between 
points occur in the horizontal layers. 
For instance, two parallel lines will 
cause a set of coincidences which is 
horizontal to the successive layers. An 
increase in angle between lines will 
cause a gradual tilting away from the 
horizontal of the successive points of 
coincidence. Such information could be 
read out from the system in different 
ways. One such possibility would be 
to have cells with fields of arboriza- 
tions shaped like discs or flattened 
spheres sending a signal out whenever 
two coincidences occurred within the 
field. With considerable overlap of 
these fields, angle would be measured 
by the number of cells sending a signal 
out. If this information was sent out 
layer by layer, a mechanism analogous 
to the one based on distance could 
Operate on the basis of angle. Such 
functioning of cells and their arboriza- 
tions is not implausible in view of the 
work of Maturana, Lettvin, McCul- 
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Fic. 3. Histograms produced by second 
mechanism. 
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loch, and Pitts (1960) on the frog's 
visual system. 


Discussion 


The question of uniqueness of trans- 
formation of a given shape has not 
been so far dealt with, No mathe- 
matical proof is so far forthcoming that 
shapes or patterns other than left- 
right or up-down mirror images are 
uniquely transformed. However, ex- 
tensive tests of a trial-and-error nature 
have failed to reveal different shapes 
with identical profiles, though shapes 
that appear similar give rise to profiles 
with certain common features. 

The “preset” systems described 
above give rise to certain determinate 
principles of classification, and so of 
similarity. Such principles seem to 
operate in the case of our own shape 
recognition and therefore they were 
set up as specifications for a theoreti- 
cal system. The similarities were 
therefore built into the system, in con- 
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trast with such attempts as those of 
Uttley (1956) and Taylor (1956). 
Here equivalences must be learned. 
Unless the machine has some already 
built in classification systems to enable 
it to abstract the principles on which it 
is being made to equate diverse groups 
of dots, its economy is bound to be low 
as compared with a preset system, 
which already abstracts the same prop- 
erties as the human operator. Its per- 
formance in novel situations is also 
likely to be poor. Sutherland (1959) 
has made a strong case for supposing 
that there is a preset classifying mech- 
anism. If there is, it is important 
from the point of view of mechanical 
shape recognition to find out what it 
is. The ideal machine would classify 
shapes in the way we do and so would 
see similarity in distorted human out- 
put, such as handwriting, where dis- 
tortion is permitted because it seems 


that the essential elements of identity 
are not lost. 
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PATTERNS OF ADRENERGIC-CHOLINERGIC IMBALANCE 
IN THE FUNCTIONAL PSYCHOSES 
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During this last decade remarkable 
strides have been made in the treat- 
ment and control of the so-called func- 
tional psychoses with the aid of chemi- 
cal agents. In view of the current 
status of the science of chemotherapy 
in this area, it is surprising that such 
beneficent results have been reported 
because there is no widely accepted 
chemical theory of the functional psy- 
choses. A therapist is unable to pre- 
Scribe, on the basis of preliminary 
laboratory examinations (of bodily 
fluids, for example), the chemical agent 
that is relevant to any specific endoge- 
nous chemical disequilibrium. One 
empirical chemotherapeutic approach 
consists of administering drugs on the 
basis of phenomenological impressions 
of simple behavioral dimensions such 
as depression or excitation. It need 
not be emphasized that few empirical 
findings and no theoretical formulations 
extant specifically relate these behav- 
ioral dimensions to any disturbed tissue 
system or state of chemical disequilib- 
rium that is reversible by chemical in- 
tervention. In another approach, the 
tacit assumption is made by therapists 
administering drugs that the popula- 
tion of functionally psychotic indi- 
viduals is homogeneous. In both ap- 
proaches, when improvement does 
occur in some fraction of the popula- 
tion, it had not been predicted. 

Prior to the formulation of several 
theories that have served to stimulate 
a great deal of recent research, numer- 
ous empirical biochemical and physio- 
logical studies were performed in the 
hope of uncovering some characteristic 


endogenous feature that differentiated 
normals from psychotics. The follow- 
ing observations are representative of 
a larger number of studies that suggest 
that the functional psychoses have as 
a concomitant some disorder of homeo- 
stasis. Fischer (1953) reported that 
schizophrenic blood was toxic to tad- 
poles, while Federoff and Hoffer 
(1956) reported it toxic to fibroblasts 
grown in tissue culture. The latter 
researchers, however, found the blood 
of patients undergoing surgery equally 
toxic. This suggested that a nonspe- 
cific perseverative response to stressful 
stimuli may be involved. In addition, 
various measurements of the response 
to stress have been made employing the 
following variables which reflect auto- 
nomic activity: pulse rate, blood pres- 
sure, skin circulation, and response to 
adrenalin (Finkelman & Stephens, 
1936; Pfister, 1938). The results of 
these studies on chronic schizophrenics 
generally indicate that they are less 
reactive than healthy people. When a 
large series of measurements was made 
of basal metabolic rate, circulation rate, 
body temperature, excretion of 17- 
ketosteroids, thyroid activity, and water 
metabolism in schizophrenics and in 
healthy people, it was found that the 
variance of the measurements was 
greater for the schizophrenics (Richter, 
1957). These studies again suggest 
that a concomitant of schizophrenia is 
some disorder of homeostasis. 

Devoid of specific deductively elabo- 
rated hypotheses, this plethora of em- 
pirical findings contributed by physi- 
ologists, pharmacologists, and behav- 
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ioral scientists stimulated the need for 
a rational approach to chemotherapy. 
Several astute attempts at Systematiza- 
tion have been made. They will be 
described briefly because they serve to 
lay the groundwork for the major the- 
sis contained in this report; that is, 
that discrete patterns of autonomic dys- 
function may be etiological factors in 
the functional psychoses. 

The major contemporary chemical 
theories of the functional psychoses are 
inextricably dependent upon the behay- 
ioral and biochemical effects of two 
psychotomimetic agents, mescaline and 
lysergic acid diethylamide (LSD-25) 
(Rubin, 1957). A biochemical theory 
of schizophrenia put forward by Os- 
mond and Smythies (1952) and stated 
more specifically by Hoffer, Osmond, 
and Smythies (1954) postulated the 
existence of an endogenous mescaline- 
like substance, the “M substance,” a 
toxic metabolite formed by the defective 
metabolism of epinephrine. They 
claimed that, whereas normal indi- 
viduals metabolize epinephrine by 
amine oxidase or sulfoesterase, the 
psychotic metabolizes epinephrine by 
a phenolase. The resulting production 
of quinone indoles, i.e., adrenochrome 
and adrenolutine, interferes with cere- 
bral metabolism. Hoffer (1957) ad- 
ministered adrenolutine (a more stable 
isomer of adrenochrome) and a pla- 
cebo to a group of subjects and he 
attempted to determine which persons 
had been given adrenolutine and which 
the placebo. The results of the experi- 
ment were equivocal. Gastaldi, as re- 
ported ina review by Smythies (1958), 
evaluated the effect of the more highly 
reactive, unstable adrenochrome and 
reported the absence of behavioral ab- 
normalities. In a recent study (Hol- 
land, Cohen, Goldenberg, Sha, & 

Leifer, 1958), equilibrium plasma a 
centrations of adrenaline and norad- 


naline were measured in healthy 
re 
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people and schizophrenics. No signifi- 
cant difference was found in the rate 
of utilization of circulating adrenaline 
and noradrenaline. Axelrod (1958) 
attempted to detect the presence of 
adrenochrome in schizophrenics and 
found that it was absent in schizo- 
phrenics as well as in healthy people. 
As yet, this chemical theory oí schizo- 
phrenia that postulates the existence 
of an endogenous metabolite of adrena- 
line produced by a deficiency in the 
process of methylation lacks empirical 
verification. 

The above biochemical theory of 
schizophrenia was established following 
the observation that mescaline, an hal- 
lucinogen, was similar in chemical 
Structure to the neurohumors, adrena- 
line and noradrenaline. The second 
theory to be considered was formulated 
after the observation was made that the 
most powerful hallucinogen, LSD, was 
a potent peripheral antagonist of sero- 
tonin (5-hydroxytryptamine). The 
clue to the important role of serotonin 
came from its discovery in the brain. 
Like acetylcholine, the substance is 
present in nervous tissue in a precursor 
state and is active only in an unbound 
form. The amine is unevenly distrib- 
uted in the brain, its concentration be- 
ing highest in the brainstem, especially 
the hypothalamus, lowest in the cortex, 
and almost undetectable in the cere- 
bellum. The high biological activity of 
this amine strongly suggests an impor- 
tant function in chemical mediation in 
subcortical centers. Additional sup- 
port for the role of serotonin in neural 
transmission comes from consideration 
of the distribution of the enzyme mono- 
amine oxidase, the enzyme that de- 
stroys serotonin, and 5-hydroxytrypto- 
Phan decarboxylase, the enzyme that 
synthesizes serotonin, Both enzymes 
are found in highest concentration in 
the hypothalamus. These considera- 
tions led Brodie and Shore (1957) to 
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hypothesize that serotonin was an im- 
portant chemical mediator of neural 
activity in the brain. Although the 
hypothesis has been fruitful, no current 
evidence has established beyond doubt 
that serotonin plays a transmitter role 
or is auxiliary to the action of other 
transmitters. Woolley and Shaw 
(1954) accepted the presumptive evi- 
dence for serotonin as a central neural 
transmitter substance and went on to 
postulate that the hallucinogenic effect 
of LSD might be due to its interference 
with the action of serotonin in the 
brain. According to these authors, the 
action of LSD, as well as other alka- 
loids, can be ascribed to the indole 
moiety in the structures of LSD and 
serotonin. Consequently, compounds 
containing the indole ring may act as 
antimetabolites to serotonin. This sug- 
gested to them that mental aberrations 
were produced by endogenously pro- 
duced indole compounds that competed 
for serotonin receptor sites in the brain. 
This attractive theory has been chal- 
lenged by at least two recent observa- 
tions. Cerletti and Rothlin (1955) 
found that a close analogue of LSD, 
2-bromo-D-lysegic acid diethylamide, 
was as effective as LSD in antagoniz- 
ing several peripheral actions of sero- 
tonin, yet this antagonist of serotonin 
was found to be devoid of hallucino- 
genic action in humans, It should be 
noted that chlorpromazine is also a 
potent antiserotonin and, instead of be- 
ing hallucinogenic, is a powerful anti- 
hallucinogen employed in the treatment 
of psychotics. 

As indicated by the foregoing review, 
considerable effort is being expended 
to understand the biochemistry of both 
the LSD and the mescaline psychotic- 
like states in normal individuals. AL 
idence provided by re- 


though the ev m 
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searchers who have worked 
these conceptual frameworks has been 
inconclusive, the quest for the specific 
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toxic endogenous metabolite goes on. 
However, we still quite realistically 
speak of the chemical control of schizo- 
phrenia and not the chemical treatment. 
A more detailed critical review of 
these theories led the author (Rubin, 
1959) to conclude that it was far from 
ineluctable that they would ultimately 
lead to an effective chemotherapy for 
the functional psychoses. At this point, 
the author would introduce his theory 
by stating that the functional psychoses 
are characterized by patterns of dis- 
ordered neurohumoral activity in the 
central representation of the autonomic 
nervous system. Since adjustments of 
the internal milieu are mediated by the 
autonomic nervous system and inte- 
grated in the hypothalamus, it is neces- 
sary to consider the humoral nature of 
hypothalamic activity before the theory 
is elaborated and supporting evidence 
is presented. Specifically, the assump- 
tion will be examined that the central 
representations of the autonomic nerv- 
ous system are cholinergic for the 
parasympathetic division and adrener- 
gic for the sympathetic division. 
Vogt (1954) examined the concen- 
tration of sympathin (a combination of 
norepinephrine and epinephrine) in the 
different parts of the central nervous 
system of the dog and found that sym- 
pathin, like acetylcholine, possessed a 
specific pattern of distribution. Two 
regions of the brain, the hypothalamus 
and the area postrema, stood out above 
all others by their high concentrations 
of norepinephrine. This fact suggests, 
though it does not prove, that these 
amines play a part in the specialized 
function of those regions of the brain 
in which their concentration is high. 
Vogt’s conclusion regarding the corre- 
lation between sympathin content and 
central representation of sympathetic 
functions was summarized in the fol- 


lowing statement : 


504 


With exception of the area postrema, 
those regions which contain the highest con- 
centration of sympathin—the hypothalamus, 
parts of the midbrain and the floor of the 
fourth ventricle—also contain the central 
representation of sympathetic activity (p. 
473). 

However, when Vogt attempted to 
localize the site of sympathin within the 
hypothalamus, it was found that the 
anterior and posterior hypothalamus 
did not show consistent differences in 
sympathin content. This is of some 
interest because of the controversy over 
whether or not the anterior hypothala- 
mus is the site of parasympathetic, and 
the posterior hypothalamus is the site 
of sympathetic centers. In this par- 
ticular histochemical aspect, the two 
regions resemble each other. When 
the hypothalamus was divided into dor- 
sal and ventral parts, however, a defi- 
nite concentration gradient was found. 
The ventral region contained more nor- 
adrenaline than the dorsal region. 
Based upon Vogt’s findings, the posi- 
tion taken in this paper is that the 
central representation of Sympathetic 
function in the hypothalamus contains 
the highest concentration of the bio- 
genic amines ; norepinephrine, epineph- 
rine, and serotonin. 

In attempting to ascertain whether 
the central representation of the auto- 
nomic nervous system may be cholin- 
ergic for the parasympatheic division, 
the following facts must be considered. 
For purposes of description, the hypo- 
thalamus in man may conveniently be 
subdivided into four regions: the su- 
praoptic middle region located above 
the optic chiasm and rostral to it, the 
tuberal or infundibular middle region, 
the mammilary middle region, and the 
lateral region. For a purpose that will 
become clearer as the discussion con- 
tinues, the composition of the e. 
optic middle region will be kou s j 
]t includes the nuclei supraopticus, 


: E rachiasmaticus 
paraventricularis, suprachia , 
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supraopticus diffusus, and the anterior 
hypothalamic area (Kuntz, 1953). 
Feldberg and Vogt (1948) attempted 
to map out, in the central nervous sys- 
tem of the dog, the distribution of the 
enzyme (or enzyme system) that syn- 
thesizes acetylcholine. They found a 
very high value of enzymatic activity 
in the region of the supraoptic nuclei. 
By their definition, this portion of the 
hypothalamic area may be considered 
cholinergic. However, the high figures 
for the supraoptic nucleus do not sug- 
gest that cholinergic fibers originate 
from cells in this region, rather, such 
fibers terminate in synaptic connection 
with these cells, Unfortunately, the 
enzymatic activity of the anterior hypo- 
thalamus, the controversial site of para- 
sympathetic activity, was not deter- 
mined. 

It seems reasonable to conclude from 
the foregoing discussion that specific 
areas within the hypothalamus cannot 
be viewed as exclusively adrenergic or 
cholinergic. The evidence does sug- 
gest that the hypothalamus contains 
large quantities of the biogenic amines 
as well as components of the choliner- 
gic system. Consequently, in the re- 
mainder of this paper, when describing 
the central adrenergic-cholinergic im- 
balance that characterizes the func- 
tional psychoses, I do not necessarily 
implicate specific sympathetic and para- 
Sympathetic central structures as such. 

Since Cannon’s (1932) formulation 
of his Emergency Theory of Emotion, 
it has been accepted generally that, 
whenever an organism is confronted 
with a sudden emergency, the resulting 
massive autonomic response is func- 
tionally adaptive, Specifically, a mas- 
sive and complete mobilization is 
achieved by the sympathetic division 
of the autonomic nervous system with 
the resultant secretion of great quanti- 
ties of epinephrine. As Grenell states 
in Fulton (1949), “The so-called ‘fight, 
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flight or fright reaction’ is a product 
of maximal generalized sympathetic ac- 
tivity.” One major feature of an or- 
ganism’s response to an emergency 
situation, therefore, is a significant in- 
crease in the release of certain sym- 
pathomimetic amines. The ascription 
of a role to the parasympathetic divi- 
sion of the autonomic nervous system 
during the emergency reaction has been 
conspicuously neglected. Its role has 
been confined to the protection, con- 
servation, and restoration of bodily 
resources. The hypothesis here offered 
is that the requisite maximal general- 
ized adrenergic activity ensuing the 
perception of threat is facilitated by a 
decrease in cholinergic outflow. If we 
consider, as Cannon did, the functional 
utility of the increased adrenergic re- 
sponse to stress, then it follows that 
maximal mobilization can be most ef- 
fectively attained under a condition in 
which central cholinergic outflow actu- 
ally decreases below levels that charac- 
terize the organism during nonstressful 
steady states. It is conjectured that, 
were cholinergic outflow to increase or 
remain at the level of nonstressful 
states during the time the organism is 
mobilizing for an emergency, the net 
adrenergic response would be less than 
maximal. As the effectiveness of the 
organism's adjustment to emergency is 
directly related to maximal mobiliza- 
tion of bodily resources, it is hypothe- 
sized that maximal mobilization can 
only result from increased activity of 
central adrenergic centers and reduc- 
tion of central cholinergic outflow. As 
a corollary, it is inferred that the pat- 
terns of autonomic dysfunction charac- 
terizing the functional psychoses con- 
sist of significant departures from the 
characteristic interactions of the ad- 
renergic and cholinergic components of 
the normal individual at rest or under 
Stress. 

It is further postulated that neuro- 


humoral imbalance in the functional 
psychoses may involve either or both 
neurohumoral systems and that patho- 
logical variation in either system is 
not unidirectional. Conceivably, either 
malfunction of the adrenergic mecha- 
nism or the cholinergic or both may 
lead to significant departures from that 
reaction requisite for maximal mobili- 
zation of the organism. A hypothetical 
matrix of varieties of neurohumoral 
imbalance has been constructed by re- 
viewing the efficacy and mode of action 
of certain drugs already employed with 
varying success in the treatment and 
control of the functional psychoses. 
Patterns of cholinergic and adrenergic 
malfunction in the psychoses may thus 
be inferred from pharmacological and 
clinical observations. From the fol- 
lowing evidence, it has been deduced 
that neurohumoral dysfunction may be 
characterized by either an excessive or 
deficient adrenergic response mecha- 
nism. 


A DisoRDERED ADRENERGIC RESPONSE 
MECHANISM IN THE FUNCTIONAL 
PSYCHOSES 


Iproniazid (Marsilid), an amine oxi- 
dase antagonist that inhibits the oxida- 
tive deamination of epinephrine, has 
been used extensively in the treatment 
of mental depression. Jn vivo studies 
on animals with iproniazid indicate that 
it can produce a relatively rapid and 
large (threefold) rise in brain seroto- 
nin (Zeller, Barsky, & Berman, 1955). 
Before proceeding to the statement of 
an important inference derived from 
these observations, it would be well to 
evaluate the controversial role of mono- 
amine oxidase in the metabolism of 
biogenic amines. 

There are at least four potentially 
possible metabolic routes for the en- 
zymatic conversions of norepineph- 
rine, epinephrine, and closely related 
catecholamines : deamination by mono- 
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amine oxidase, oxidation to adreno- 
chrome and related products by cyto- 
chrome oxidase and tyrosine oxidase, 
conjugation to the glucuronide or sul- 
fate, or O-methylation by catechol-O- 
methyl transferase (Koelle, 1959). 
Though the importance of monoamine 
oxidase in the oxidation of catechol- 
amines is controversial, it is well es- 
tablished that dopamine, a precursor 
of norepinephrine and epinephrine, js 
oxidized by monoamine oxidase at a 

reasonably rapid velocity. Further- 

more, according to Bacq (1949), there 

is little possibility that the catechol- 

amines are oxidized to indole deriva- 

tives by cytochrome oxidase im vivo. 

As far as the third metabolic pathway 

listed above is concerned, the results 

of a great deal of work suggest that 

free catcholamines and their imme- 

diate metabolites from other pathways 

are conjugated in considerable pro- 

portions. Finally, LaBrosse, Axelrod, 

and Kety (1958) have suggested that 

catechol-O-methyl transferase is the 

enzyme of major importance in the 

metabolism of circulating norepineph- 

rine and epinephrine. They arrived at 

this conclusion after they observed 

that over 90% of a 1-mg. dose of di— 

epinephrine given to normal human 

subjects by intravenous infusion—was 

recovered as the free or conjugated 

form of the primary product, meta- 

nephrine, or its subsequent oxidation 

product, 3-methoxy-4-hydroxymande- 

lic acid. It is important to note that 

the authors still invoke a role for 
monoamine oxidase. It participates in 
the oxidation of metanephrine to the 
mandelic acid derivative. Pekkarinen, 
Castrin, and Israel (1958) present 
evidence suggesting that amine oxidase 
acts directly upon the sepia, 
This position is supported by the be 
hore (1958) who found increase 

of inr of norepinephrine in the brain 
ri other organs of small animals 
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after treatment with iproniazid. On 
the basis of his studies, he concluded 
that: “norepinephrine, as well as sero- 
tonin, is normally metabolized in vivo 
largely through the action of mono- 
amine oxidase.” He criticized much 
of the experimental evidence which 
suggested that monoamine oxidase is 
unimportant in catecholamine metabo- 
lism by emphasizing the erroneous use 
by others of in vitro tissue prepara- 
tions. When monoamine oxidase was 
blocked in vivo by Marsilid, Shore 
could not find any alternate major 
pathway for the metabolism of norepi- 
nephrine. Evidence for or against 
each metabolic route cannot begin to 
be adequately described in this paper. 
A reasonable conclusion is that while 
monoamine oxidase may not be the 
sole enzyme system involved in the 
degradation of the catecholamines, it 
must be considered a significant part 
of the mechanism, 

To return to the effects of iproni- 
azid on the concentration of biogenic 
amines, it is likely, according to Truitt 
(1958), that this drug, through its 
inhibition of monoamine oxidase in the 
brain, potentiates the action therein of 
endogenous stimulant biogenic amines 
(possibly norepinephrine, epinephrine, 
and serotonin). The side effects (con- 
stipation, difficulty in micturition, hy- 
perreflexia, Sweating, and dryness of 
the mouth) are also compatible with 
a hypothesis of peripheral sympathetic 
stimulation. The implication drawn 
from these observations is that some 
psychotics Who are benefited by ipro- 
niazid are defective in some brain 
catecholamines, or in central adren- 
ergic stimulation. 

When one turns to the treatment 
or control of the Schizophrenic re- 
action, it is found that reserpine and 
chlorpromazine have been extensively 
Used. Hyperemotional responses me- 
diated by sympathetic centers in the 
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hypothalamus are markedly depressed 
by the action of reserpine. Peripher- 
ally, the syndrome produced by reser- 
pine is marked by bradycardia, slight 
lowering of blood pressure, stimula- 
tion of peristalsis, and gastric hyper- 
secretion. Brodie and Shore (1957) 
proposed that reserpine prevented 
serotonin binding and thereby caused 
constant activation of the vegetative, 
parasympathetic centers. In addition, 
reserpine was found to deplete the 
store of biogenic catecholamines in 
the body as determined by the in- 
creased excretion of 5-hydroxy-indole- 
acetic acid, a metabolite of serotonin. 
Evidence has been presented in Truitt 
which favors a role for serotonin as 
a central excitatory agent at least in 
subcortical centers. The hyperactivity 
of serotonin, rather than its deficiency, 
therefore, may play a dominant role 
in mediating uncontrolled emotional 
responses in acute psychoses. Thus, 
the mechanism of reserpine action may 
be by depletion of brain concentration 
of serotonin and other catecholamines 
rather than through the sedative action 
of released serotonin. These observa- 
tions therefore suggest that some func- 
tionally psychotic patients for whom 
reserpine therapy is efficacious may 
be characterized by an excessive 
adrenergic activity in subcortical cen- 
ters. This same conclusion is also 
applicable when chlorpromazine is 
considered, for, in addition to its pe- 
ripheral antihistaminic and antichol- 
inergic effects, it has been shown to 
have an adrenergic blocking action. 

In summary, it is postulated that 
the neurohumoral dysfunction that 
characterizes some psychotics may be 
due to either (a) a neurohumoral de- 
some sympathomimetic 


ficiency of : 
yme disorder 


amine or to some enz 


such as mon x . 
activity in the adrenergic excitatory 


mechanism of the brain; OT, (b) a 


oamine oxidase over- , 
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neurohumoral overabundance of some 
sympathomimetic amine or to some 
enzyme disorder such as monoamine 
oxidase underactivity in the adrenergic 
mechanism of the brain. 


A DISORDERED CHOLINERGIC RESPONSE 
MECHANISM IN THE FUNCTIONAL 
PSYCHOSES 


There are suggestions that the cho- 
linergic component of generalized au- 
tonomic activity is also an important 
parameter of the functional psychoses. 
Forrer (1956) has recommended the 
use of large doses of tertiary atropine 
in the treatment of manic-depressives 
in the manic phase. He claimed that 
the patients in the manic phase of 
manic-depressive psychosis responded 
remarkably well and immediately to 
this chemotherapy but he offered no 
mechanism which could possibly ac- 
count for the effectiveness of this 
drug. As the pharmacological action 
of atropine is that of a cholinergic 
blocking agent, it may be inferred that 
excessive brain acetylcholine or de- 
ficient acetylcholinesterase with mani- 
fest excessive cholinergic outflow is 
important to the understanding of the 
chemical fundament in some psy- 
chotics. A strong theoretical state- 
ment which in part supports this in- 
ferred mechanism as to the efficacy of 
atropine therapy has been offered by 
Hoffer et al. (1954). He has claimed 
that excessive parasympathetic activity, 
with concomitant increases in the con- 
centration of acetylcholine, is an es- 
sential feature of the schizophrenic 
process. In contrast to Forrer, how- 
ever, Hoffer believes that cholinergic 
blocking agents as well as choliner- 
gic compounds should aggravate the 
schizophrenic process and perhaps 
make psychotics worse. 

Paradoxically, Pfeiffer, Jenney, Gal- 
lagher, Smith, Bevan, Killam, Killam, 
and Blackmore (1957) have reported 
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Fic. 1. Patterns of adrenergic-cholinergic 
interaction for normals and psychotics. 


that some schizophrenics respond with 
a lucid interval to stimulation by cho- 
linergic stimulants such as pilocarpine, 
arecoline, and 2-dimethylamine etha- 
nol. The implication of this report is 
that some psychotics are characterized 
by defective cholinergic responsive- 
ness. Recently, Rubin (1958a, 1958b) 
has reported that a functionally psy- 
chotic sample of patients was signifi- 
cantly differentiable from normals by 
virtue of the rate of hydrolysis of 
acetylcholine chloride by erythrocyte 
cholinesterase. All of the patients em- 
ployed in the study manifested a defect 
in rate of hydrolysis. Some subjects 
hydrolyzed the substrate too rapidly 
and others too slowly. The inference 
made, on the basis of the kinetics of 
the enzymatic reaction, was that some 
psychotics may be deficient in cholin- 
ergic activity while others may be 
characterized by excessive cholinergic 


activity. 
PATTERNS OF DISORDERED ADREN- 


ERGIC-CHOLINERGIC STEADY STATES 
IN PSYCHOTIC DISORDERS 


A hurried review of the evidence 
sented seems replete with paradox- 
re acts and theories. At first it 


cal f ffective chemotherapy 


ippears that an € 


for some psychotics should counter- 
act deficient adrenergic activity. Then, 
it seems that more effective drugs 
which decrease adrenergic activity 
should be developed. Additional con- 
flict and confusion is raised by asser- 
tions of efficacy for cholinergic as well 
as anticholinergic drugs. These ap- 
parent contradictions may be resolved 
by regarding the population of func- 
tional psychotics as consisting of a 
specifiable number of discrete patterns 
of disordered adrenervic-cholinergic 
levels of reactivity. Vj ache preceding 
review of the pharni’ 7, litical action of 
certain drugs on th “or "rohumoral 
components of the autonomic nervous 
System, it is assumed that in the popu- 
lation of psychotics there may exist 
three levels of adrenergic activity: 
Overreactivity, deficiency, and a nor- 
mal level from which directional de- 
viations are measurable. The cholin- 
ergic outflow may also be described 
as consisting of three levels: over- 
reactivity or deficiency of cholinergic 
levels as measured by reference to a 
level characteristic of the unstressed 
normal The matrix of adrenergic- 
cholinergic steady states derivable from 
Such a paradigm is presented in Figure 
l. In this figure, A represents central 
adrenergic activity and C central cho- 
linergic activity. “The levels of activity 
of each dimension are symbolized by 
+, 0 and —, The positive sign rep- 
resents excessive activity, the zero, 
normal activity, and the negative sign, 
deficient activity. The combination of 
AoCo represents the levels of adren- 
ergic and cholinergic activity that 
characterize the normal human adult 
at rest. The eight other combinations 
represent hypothetical adrenergic-cho- 
inergic levels of interaction. Pre- 
viously, the hypothesis had been de- 
veloped that a normal, ordered response 
to stress consisted of maximal gen- 
eralized sympathetic activity that was 
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attained when cholinergic stimulation 
was decreased (C—), while adren- 
ergic stimulation continued to increase 
(A+) up to that maximal level which 
characterized the individual. This 
characteristic adjustment is contained 
in Figure 1, and may be represented 
as A--C—. It is now further hypothe- 
sized, within the framework of the 
general theory, that the remaining 
seven interactions between adrenergic- 
cholinergic activity represent the dis- 
crete patterns which comprise the 
population of functional psychotics. 
To recapitulate, the major hypotheses 
elaborated from the above considera- 
tions are: (a) Effective autonomic 
adjustment to emergency situations 
requires a maximal generalized adren- 
ergic response; (b) To attain a maxi- 
mal level of sympathetic activity, it 
is not only necessary that increased 
levels of adrenergic activity follow the 
emergency but that the level of cho- 
linergic activity decrease; (c) The 
functional psychoses are characterized 
by patterns of neurohumoral dysfunc- 
tion that deviate significantly from the 
pattern of the normal, healthy individ- 
ual at rest, as well as from the adren- 
ergic increase and cholinergic decrease 
that characterizes the normal individ- 
ual during stress; and (d) The va- 
rieties of neurohumoral dysfunction 
represent discrete levels of activity of 
interacting adrenergic and cholinergic 
mechanisms. That is, each pattern of 
neurohumoral dysfunction represents 
an impairment of function of either the 
adrenergic or cholinergic mechanism 


or both. 


Puprecary Reactivity AS A METHOD 
FoR INVESTIGATING PATTERNS OF 
DISORDERED ADRENERGIC-CHO- 
LINERGIC STEADY STATES IN 
Psycrotic DISORDERS 


The hypotheses elucidated would 
remain interesting speculations unless 
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a method were described which could 
systematically measure the relevant 
parameters. To this end a physio- 
logical measure of adrenergic and cho- 
linergic activity was developed for in- 
vestigating the patterns of disordered 
autonomic steady states in psychotic 
disorders. The miotic response of the 
pupil to light (constriction) and the 
mydriasis ensuing the absence of light 
(dilatation) were employed to meas- 
ure cholinergic and adrenergic mecha- 
nisms, respectively. The rationale of 
the procedure is suggested by the fol- 
lowing considerations. The choliner- 
gic nature of the activity of the oculo- 
motor nerve under the influence of 
light has been proved by the positive 
assay for acetylcholine in the anterior 
chamber of the eye (Engelhart, 1931) 
and by the disappearance of the light 
reflex after the injection of cholines- 
terase (Mendel & Hawkins, 1943). 
Conversely, the anticholinesterases in- 
duce miosis, presumably by interfering 
with the hydrolysis of acetylcholine by 
acetylcholinesterase. The adrenergic 
nature of the dilatory response of the 
pupil is suggested by the work of Poos 
(1939), Hess and Koella (1950), and 
Hess, Koella, & Szabo (1950) on iso- 
lated radial preparations of the iris. 
These studies demonstrated that the 
iris muscle contracted in response to 
small concentrations of adrenalin and 
noradrenalin. Since these substances 
are responsible for neurohumoral 
transmission in sympathetically inner- 
vated structures, they believed it rea- 
sonable to assume that a dilator muscle 
existed that responded to sympathetic 
excitation. In addition, Frohlich and 
Loewi (1910) had shown that the 
mydriasis induced by adrenalin is in- 
tensified by cocaine, a monoamine 
oxidase inhibitor. 

Since pupillary constriction had been 
shown to be dependent upon the lib- 
eration of acetylcholine, and dilatation 
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upon an adrenergic mediator, the rate 
of dilatation was considered to repre- 
sent the kinetics of a general adrenergic 
mechanism, while the rate of constric- 
tion was considered indicative of the 
kinetics of the cholinergic mechanism. 
If the varieties of disordered neuro- 
humoral function that characterize the 
functional psychoses do represent dis- 
crete levels of activity of interacting 
cholinergic and adrenergic mecha- 
nisms that differ significantly from the 
patterns manifested by normals at rest 
and under stress, then different pat- 
terns of constriction and dilatation 
should be observable. That is, it was 
assumed that peripheral cholinergic 
and adrenergic imbalances, respec- 
tively, mirror central cholinergic and 
adrenergic imbalances. 


PROCEDURE 


Although the apparatus and procedure for 
obtaining measurements of pupillary dilata- 
tion and constriction have been described else- 
where (Rubin, 1960, 1961a, 1961b), it seems 
appropriate to emphasize certain features of 
the procedure for increased clarity in exposi- 
tion. To obtain the rate of dilatation for each 
subject, the following procedure was em- 
ployed. After the subject was seated with his 
head positioned properly in a head rest, the 
room light was extinguished. After 5 seconds 
in the dark, an intense adapting light of 38.6 
ft.-C. was focused on the subject’s right eye 
for 10 seconds. At the termination of this 
interval, a photograph of the maximally 
constricted pupil was taken automatically. 
This was done twice in an effort to equate 
for previous conditions of adaptation of the 
retina. Each subsequent photograph of the 
pupil was taken after the subject had been 
dark-adapted for a prescribed interval of 
time following maximal constriction, The 
intervals of dark-adaptation employed were 
5, 10, 15, 30, 45, and 60 seconds. The pro- 
rea employed to obtain measurements of 
coore i of pupillary constriction was in- 
E soon after measurements of pupillary 
1ta o had been obtained. Each photo- 
QURE f the pupil was taken after the sub- 
ho been light-adapted for a specific 
ject Tap n e iuc ATIS 
is the intervals of light-adaptation em- 


grapl 


ployed were 1, 2, 3; 4, and 5 seconds. Prior 
to obtaining a measurement of constriction 
at each of these intervals, the eye was light- 
adapted for 10 seconds at a constant intens- 
ity before it was permitted to dilate maxi- 
mally within the 60-second dark-adaptation 
period. 


PATTERN oF ADRENERGIC-CHOLINER- 
GIC Activiry OF THE NORMAL 
ADULT UNDER STRESS 


To test the hypothesis that normal 
adult subjects react to emergencies 
with an increased adrenergic response 
and decreased cholinergic response, 
measurements of pupillary dilatation 
and constriction were obtained on six 
normal, healthy adults between the 
ages of 19-40 (Rubin, 1960). Due 
to the variety of perceptual responses 
that are elicited in humans to the 
same stimulus, it is difficult to define 
a stimulus situation that is realizable 
in a laboratory as an “emergency” or 
as a “threat.” Consequently, follow- 
ing the control measurements obtained 
when the subjects were at rest, each 
subject was required to immerse the 
right hand and forearm in a cold water 
bath maintained at 5° C. while meas- 
urements were made of dilatation and 
constriction. When the rate of pupil- 
lary dilatation at rest was compared to 
the rate obtained during the stress 
condition, it was found that the slopes 
of the curves were equal (parallel) 
but that the absolute level of dilatation 
under the cold pressor condition was 
significantly greater (Figure 2). As- 
suming that the responsiveness of the 
dilator pupillae is an increasing mono- 
tonic function of the amount of adren- 
ergic mediator liberated at the effector 
site, it may be concluded that under 
Stress a greater quantity of the adren- 
ergic mediator is liberated. Therefore, 
the finding that the cold pressor test 
increased pupillary dilatation signifi- 
cantly was interpreted as supporting 
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Fic. 2. Pupillary dilatation in normal sub- 
jects exposed to the cold pressor test. 


the hypothesis that under a condition 
of stress, the normal human adult 
responds with an increased adrenergic 
outflow. 

When the rates of pupillary con- 
striction over a period of 5 seconds 
for the normal individuals at rest and 
under the condition of the cold pressor 
were compared, a significant difference 
was obtained (Figure 3). As opposed 
to the relationship observed during 
dilatation, the level of constriction of 
the pupil under the condition of the 
cold pressor test was decreased. This 
suggested a decreased cholinergic out- 
flow in normal humans under stress, 
when it was assumed that the degree 
of constriction of the sphincter pupil- 
lae was an increasing monotonic func- 
tion of the amount of acetylcholine 
liberated at the effector sites. 

The significance of these findings. 
is contingent upon two basic 
t the cold pressor test 
or equivalent to other 
ogical or psychological 
stress, and that the adrenergic and 
cholinergic reactivity of the pupil is 
representative of adrenergic-choliner- 
gic mechanisms of interaction in vital 


however, 
assumptions : tha 
is representative 
forms of physiol 


neural integrative centers. If these 
assumptions are accepted, the follow- 
ing interpretation of the results of the 
experiment may be made with reason- 
able assurance. Upon exposure to a 
stress-engendering situation, the nor- 
mal human organism responds with 
an increased adrenergic outflow and a 
decrement in cholinergic outflow. This 
reciprocal balance between the sys- 
tems insures maximal mobilization of 
the resources of the organism required 
ior "fight, flight, or fright." 


ADRENERGIC-CHOLINERGIC REACTIV- 
Iry LEVELS or FUNCTIONAL 
PsyCHOTICS 


Several additional experiments were 
performed by the author (Rubin, 
1961a, 1961b) in order to test two 
additional hypotheses. The first hy- 
pothesis tested was that the population 
of psychotics exhibited three distinct 
levels of adrenergic activity and three 
distinct levels of cholinergic activity. 
The second hypothesis tested was that 
psychotics manifest patterns of neuro- 
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Fic. 3. Pupillary constriction in normal 
subjects exposed to the cold pressor test. 
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humoral dysfunction that deviate sig- 
nificantly from the pattern of normals 
at rest, as well as from the adrenergic 
increase and cholinergic decrease that 
characterizes the normal person dur- 
ing stress. Furthermore, the varieties 
of neurohumoral dysfunction could be 
represented by discrete patterns of 
levels of adrenergic-cholinergic inter- 
action, as described previously in Fig- 
ure 1, 
To test these hypotheses, measure- 
ments of dilatation and constriction 
were obtained on 23 healthy, control 
subjects between the ages of 19-40 
years. To determine the extent of di- 
latation between the fifth and sixtieth 
seconds, independent of the individual 
differences in the final level of dilata- 
tion attained, the following measure- 
ment was employed. "The initial level 
of dilatation (mm.) was subtracted 
from the final level. (mm.) and the 
difference was divided by the final 
level of dilatation. The ratio obtained 
in this manner for each subject repre- 
sented the increase in diameter of the 
dark adapted pupil as a function of 
the maximal dilatation attained at the 
end of 60 seconds spent in darkness. 
Assuming that the sample of healthy 
subjects employed in the experiments 
is a random sample from a population 
with a normal distribution for the ob- 
tained ratio, then the sample mean, 
as an unbiased estimate of the popula- 
tion mean, may be employed to deter- 
mine the likelihood of obtaining values 
for the ratio that deviate significantly 
in the direction of excessive or defi- 
cient dilatation. A graphic test to 
determine the normality of the dis- 
tribution (Bliss & Calhoun, a of 
the dilatation ratio was perfor med. 
The rankits were plotted against their 
r ling variates and a straight 
esponding A dre x 
E fittel easily bv inspection, 
[hie "Was n - suggested that the sam- 
This strongly m udi scores was dis- 
ple of dilatatio 
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tributed normally. One could now 
make inductive inferences from the 
sample with a reasonable degree of 
assurance. Consequently, the cumula- 
tive distribution function was em- 
ployed to determine the relative fre- 
quency with which observations fell 
beyond the extreme dilatation ratios 
obtained from the sample. 

In order to determine the extent of 
pupillary constriction, measurements 
of the diameter of the 1 minute dark- 
adapted pupil were obtained following 
exposure to a constant intensity of 
light that lasted for 1, 2, 3. 4, or 5 
seconds. A constriction ratio for each 
subject was obtained in the following 
manner. To avoid negative values, 
the final level of constriction (mm.) 
was subtracted from the invariably 
greater initial level (mm.) and then 
divided by the final level. The graphic 
test for normality of the distribution 
of the ratios was performed and it was 
found that they were distributed nor- 
mally. The cumulative distribution 
function was again employed to deter- 
mine the likelihood of obtaining con- 
striction ratios that fell beyond the 
extreme constriction values of the 
sample. 

The probability of obtaining dilata- 
tion or constriction ratios significantly 
greater than or less than the extreme 
values in each distribution was then 
employed to evaluate the measure- 
ments obtained from a sample of psy- 
chotics. 

The results to be described were 
obtained on two samples of patients 
who were tested at two different psy- 
chiatric institutions. As the results 
of the first experiment were confirmed 
by the second experiment, the follow- 
ing description pertains to 71 patients 
who had been diagnosed as function- 
ally psychotic (Rubin, 1961b). Em- 
ploying the probability limits obtained 
for the distribution of the dilatation 
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ratio in the sample of normal individ- 
uals, it was found that 21 of 71 pa- 
tients manifested excessive dilatation, 
26 of 71 patients manifested deficient 
dilatation, and 24 of 71 patients mani- 
fested normal dilatation. These rela- 
tionships are presented in Figure 4 
where the mean dilatation ratio for 
cach of the three groups at each time 
interval is presented as a function of 
the amount of time spent in darkness. 

Inspection of the figure clearly indi- 
cates that there are three levels of 
dilatation that characterize the sample 
of patients employed in the experi- 
ment. The hypothesized three distinct 
levels of adrenergic activity, as meas- 
ured by pupillary reactivity, seem to 
have been confirmed. That is, if it 
is assumed that the magnitude of pu- 
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Fic. 4. Reactivity levels of pupillary di- 
latation in psychotic patients. (The values 
of the ordinate, D,-D,/D,, consist of the 
final level of dilatation, mm., after 60 sec- 
onds of dark adaption, D,, minus the level 
of dilatation at each interval in darkness, 
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of darkness in seconds, t, preceding photo- 
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Fic. 5. Reactivity levels of pupillary 
constriction in psychotic patients. (The 
values on the ordinate, D,-D,/D,, consist 
of the average diameter—mm.—of the con- 
stricted pupil at each interval, Ds, minus 
the diameter of the maximally constricted 
pupil after 5 seconds of light adaptation, Di, 
divided by Dy. The abscissa represents the 
duration of light adaptation in seconds, f.) 


pillary dilatation is proportional to the 
quantity of adrenergic mediator liber- 
ated, then it may be concluded that 
some of the patients liberated a quan- 
tity that was characteristic of normals 
at rest; others liberated more of the 
adrenergic mediator or had some en- 
zyme disorder such as monoamine oxi- 
dase underactivity ; and some liberated 
too little of the adrenergic mediator or 
had some enzyme disorder such as 
overactivity of monoamine oxidase. 
Employing the probability limits ob- 
tained from the distribution of the 
constriction ratio in the sample of nor- 
mal individuals, it was found that 36 
of 71 patients manifested excessive 
constriction, 7 of 71 patients mani- 
fested deficient constriction, and 28 of 
71 patients showed normal constric- 
tion. These relationships are pre- 
sented in Figure 5 where the average 
constriction ratio for each of the three 
groups at each interval is represented 
as a function of length of time of ex- 
posure to à constant intensity light 
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source. When the curve for those 
showing defective constriction is com- 
pared to the curve representing nor- 
mal constriction, it is seen that the 
amount of change in pupil size for the 
deficient group remains smaller than 
for normal constriction. For the group 
manifesting excessive constriction, it 
is apparent that the amount of con- 
striction per unit time is much greater 
than that of the normal or defective 
group. With respect to constriction, 
therefore, three levels, each signifi- 
cantly different from the other, were 
found in the sample of patients em- 
ployed in the studies, Assuming that 
the tendency toward constriction of 
the sphincter pupillae is proportional 
to the quantity of acetylcholine liber- 
ated, the demonstration of three levels 
of constriction suggests that: (a) in 
one group of patients, either excessive 
quantities of acetylcholine were liber- 
ated or there was deficient acetylcho- 
linesterase activity; (b) in a second 
group of patients, insufficient quanti- 
ties of the neurohumor were liberated 
or enzyme activity was excessive; (c) 
the third group manifested that level 
of cholinergic activity that was equiva- 
lent to the level of normal subjects. 
These observations on the kinetics of 
adrenergic and cholinergic activity 
confirmed two hypotheses deduced 
from the theory ; that is, the existence 
of three distinct levels of adrenergic 
activity and three distinct levels of 
cholinergic activity. 

In order to establish that there are 
yarieties of neurohumoral dysfunction 
that comprise the population of func- 
tional psychotics, it remained neces- 
sary to show that the disorders con- 
sisted of an impairment of function 
" either the adrenergic or cholinergic 

-hanism, or both. The demonstra- 
ope qoe patterns of interaction 
Vn derived from the following con- 
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siderations. 
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psychotic patients, it will be recalled, 
showed no impairment in the extent of 
dilatation. However, all of the mem- 
bers in this group manifested aberrant 
constriction. Twenty of the 24 mani- 
fested excessive constriction, while 4 
patients were deficient in constriction. 
Conversely, of the 28 psychotic pa- 
tients out of 71 who showed no im- 
pairment in constriction, all were aber- 
rant in the extent of dilatation; 12 
showed excessive dilatation and 16 
manifested deficient dilatation. In 
confirmation of the major hypothesis, 
it may be stated that all of the 71 pa- 
tients employed in the studies showed 
a significant impairment in either the 
tendency to dilatation or to constric- 
tion or both. Furthermore, if the 
magnitude of dilatation is considered 
to be proportional to the quantity of 
adrenergic mediator liberated, and the 
amount of constriction of the sphincter 
pupillae proportional to the quantity of 
acetylcholine liberated, then it may be 
concluded that all of the psychotics em- 
ployed in these studies were differenti- 
able into seven discrete patterns of 
adrenergic-cholinergic imbalance. 

The varieties of adrengeric-cholin- 
ergic dysfunction elucidated indicate 
that the population of functionally 
psychotic individuals is heterogeneous. 
Thus, the current quests for a chemi- 
cal panacea that affects only one of the 
components of the autonomic nervous 
System undirectionally do not seem 
promising. Instead, the results of 
these studies Suggest that each pattern 
of adrenergic-cholinergic activity may 
require a specific combination of drugs 
Which differentially affect the level of 
activity of either or both components 
(Table 1). The value of the present 
Study is that it seems to provide a ra- 
tionale for the selection of patients 
who require distinct chemotherapies 
as determined by their specific pattern 
of adrenergic-cholinergic imbalance. 
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An obvious but unwarranted im- 
plication that may be drawn by the 
reader of the foregoing assertion is 
that the author believes with warrant- 
able assertability that these patterns 
of autonomic dysfunction are etiologi- 
cal factors in the functional psychoses. 
Admittedly, it is difficult to conceive 
of an environmental process that could 
produce central neurohumoral dys- 
function, However, the experiments 
described in this report were not de- 
signed to answer the question, “Is 
central neurohumoral dysfunction the 
invariable antecedent of psychotic be- 
havior or the effect of dynamic, en- 
vironmental variables?” Although the 
answer to the question may be of 
academic interest, it would be more 
fruitful to accept the attitude of a 
pragmatist and insist that truth is pre- 
eminently to be tested by the practical 
consequences of belief. 


ApnENERGIC-CHOLINERGIC DYSFUNC- 
TION IN PsycHOTIC CHILDREN 


The technique of employing pupil- 
lary measurements to determine adren- 
ergic-cholinergic steady-states that 
characterize adult psychotics was also 
applied to ascertain whether autistic, 
psychotic children were significantly 
differentiable from normal children. 
An additional comparison could read- 
ily be made between those steady-states 
that characterized normal children and 
the adrenergic-cholinergic responsivity 
of normal adults. Thus an opportunity 
was also afforded to contribute to an 
understanding of the ontology of au- 
tonomic reactivity in normal individ- 


uals. 
In the prelimi 
described, data o 


nary experiment to be 
n dilatation and con- 
striction was obtained on four normal, 
healthy children between the ages of 


7-9.5 years, and five autistic children 


between the ages of 7-12 years. A 


TABLE 1 


A CLASSIFICATION OF FUNCTIONAL PSYCHOSES 
INTO DISCRETE LEVELS OF ADRENERGIC- 
CHOLINERGIC INTERACTION AND THE 
DERIVED CHEMOTHERAPIES 


Adrenergic- 
Group Cholinergic Suggested chemotherapy 
pattern 
I | A+C-+ | Adrenolytic agent in ad- 
dition to an anticho- 
linergic compound 
II | AoC+ Anticholinergic com- 
pound 
HI | A—C+ | Adrenergic stimulant in 
addition to an anticho- 
linergic compound 
Iv | A+Co Adrenolytic agent 
v | A—Co | Adrenergic stimulant 
VI | AoC— Cholinergic stimulant 
VIL | A—C— | Adrenergic stimulant in 
addition to cholinergic 
stimulant 


statistical analysis of the rate of dilata- 
tion between these groups indicated 
that the slopes were equal and parallel 
but separated significantly by a con- 
stant (Figure 6). Again, assuming 
that the magnitude of the response of 
the dilator pupillae is an increasing 
monotonic function of the amount of 
adrenergic mediator liberated at the 
effector site, then the absolute level of 
adrenergic activity for the normal chil- 
dren is greater than that for the psy- 
chotic children. This suggests that 
normal children have a significantly 
greater capacity for increased adren- 
ergic outflow than autistic children. 
When the two groups of children were 
compared with respect to the rate of 
constriction, it was found that the 
rate was significantly faster for the 
normal children, although no signifi- 
cant differences obtained in the final, 
absolute level of constriction (Figure 
7): This result was interpreted as 
indicating that the rate of cholinergic 
activity was more rapid for normal 
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children. These results suggest that 
normal children have a greater capac- 
ity for sympathicoadrenal response, 
and, of equal importance. a greater 
manifest capacity to reestablish more 
quickly the vegetative functions me- 
diated by the cholinergic system. 
Clearly, autistic children demonstrate 
a pattern of autonomic dysfunction 
when compared to normal children, 
However, the limited sample sizes em- 
ployed in this preliminary experiment 
precluded the ultilization of the statis- 
tical techniques employed to establish 
discrete patterns of neurohumoral dys- 
function in adult psychotics, 
Reference to Figure 6 reveals an- 
other experimental comparison that 
may throw some light upon the on- 
tology of adrenergic responsivity. 
When the course of dilatation for nor- 
mal children was compared to that of 
normal adults, it was found that the 
mean pupil diameter at each interval 
for the children was significantly 
greater than that of the normal adults. 
Again, if it is assumed that the amount 
of dilatation is proportional to the 
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as a function of duration of light adaptation 
for normal adults, normal children, and au- 
tistic, psychotic children, 


amount of adrenergic mediator liber- 
ated, then it may be concluded that 
normal children have a significantly 
Sreater capacity for increased adren- 
ergic outflow. An interesting differ- 
ence in cholinergic reactivity was re- 
vealed when the course of constriction 
in normal adults was compared to that 
of normal children, A statistical analy- 
sis of the results presented in Figure 
7 revealed that normal adults mani- 
fested significantly greater constriction 
than normal children, However, al- 
though the absolute level of cholinergic 
activity was found to he consistently 
greater for the normal adult, the rate of 
cholinergic activity was found to be sig- 
nificantly greater for the normal child. 
These results suggest that normal chil- 
dren and normal adults are significantly 
different with respect to autonomic 
Tesponsivity. Furthermore, although 
the intermediate biochemical (ge- 
netic?) factors responsible for the 
change in neurohumoral activity are 
not as yet known, it would seem that 
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the normal child, as he develops into 
a normal adult, undergoes changes that 
reduce the level of adrenergic outflow, 
decrease the rate of cholinergic activity, 
and increase the absolute level of 
cholinergic outflow. 


SUMMARY 


The results of the studies described 
in this report confirmed several hy- 
potheses derived from a theoretical 
. formulation of the role of adrenergic 
and cholinergic mechanisms in psy- 
chotic disorders. The salient features 
of the theory are: effective adjust- 
ments to emergency situations require 
a maximal, generalized sympathetic 
response; to attain a maximal level of 
sympathetic activity, it is not only nec- 
essary that increased levels of adren- 
ergic outflow follow the emergency but 
that the level of cholinergic activity 
decrease; the neurohumoral dysfunc- 
tions that characterize the psychoses 
represent significant departures from 
the adrenergic-cholinergic states that 
characterize the normal individual at 
rest and during stress; and the varie- 
ties of dysfunction represent discrete 
levels of activity of interacting adren- 
ergic and cholinergic | mechanism. 
That is, cach pattern of neurohumoral 
imbalance represents an impairment of 
function of either the adrenergic or 
cholinergic mechanism or both. 

The nature of the adrenergic impair- 
ment that characterizes some psychotics 
may be a neurohumoral deficiency of 
some biogenic amine or some enzyme 
disorder such as monoamine oxidase 
overactivity in the adrenergic centers 
of the brain. Other psychotics mani- 
fest adrenergic impairment by a neuro- 
humoral excess of some sympatho- 
mimetic amine that may be the result 
of some enzyme disorder such as 
monoamine oxidase underactivity in 
central adrenergic centers. Similarly, 
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the cholinergic impairment of some 
psychotics may be characterized by ex- 
cessive concentrations of brain acetyl- 
choline or deficient acetylcholinester- 
ase, while other psychotics may possess 
an inadequate level of acetylcholine or 
excessive acetylcholinesterase activity. 
A theoretical matrix of neurohumoral 
steady states, derived from considera- 
tions of the interaction of the levels 
of the two systems, characterized the 
population of functional psychotics as 
well as the normal individual at rest 
and under stress. ' 

The application of measurements of 
pupillary reactivity to the study of 
autonomic dysfunction in autistic, psy- 
chotic children demonstrated that ad- 
renergic-cholinergic imbalance clearly 
differentiated these children from nor- 
mal children. The significance of this 
obtained difference was evaluated with 
respect to the pattern of autonomic 
function manifested by normal adults. 

That the adult psychotic may be 
classified into any one of seven discrete 
categories of adrenergic-cholinergic 
activity levels throws reasonable doubt 
upon the current quest for a single 
chemical panacea. Instead, the eluci- 
dation of these discrete patterns of 
adrenergic-cholinergic activity suggests 
a rational approach to chemotherapy. 
Each pattern of adrenergic-cholinergic 
dysfunction requires a specific combi- 
nation of drugs that differentially affect 
the levels of activity of either or both 
components of the autonomic nervous 
system. 
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'The problem solving process, which 
begins with the presentation of a 
problem, is terminated when the sub- 
ject arrives at a correct solution, when 
he arrives at an incorrect solution he 
believes correct, or when the experi- 
menter arbitrarily ends the experi- 
mental session. Performance on a 
problem is usually summarized by the 
proportion of subjects who solve, or 
by the mean or median time to 
solution. A more complete descrip- 
tion of the data gives the success of 
each subject (whether he obtained the 
correct answer or not) and the time 
taken; that is, a distribution of 
solution times or times until terminal 
error. One purpose of this paper is to 
show how the full distributions can be 
used to answer theoretical questions 
about the structure of the problem. 

Small ad hoc groups of subjects can 
work together on verbal puzzles. 
Such cooperation seems natural and 
is appropriate to the nature of the 
problem and the usual behavior of 
college students. Generally Speaking, 
small groups solve problems more 
rapidly and surely than individuals 
from the same population (Faust, 
1959; Husband, 1940; Lorge, Fox, 
Davitz, & Brenner, 1958; Lorge, 


1 Developed from part of a doctoral 
dissertation submitted to Michigan State 
University by James H. Davis (1961). The 
irs thor directed the dissertation problem. 
ce Research Fellow of the Social 


Science Research Council. Now at Indiana 
Scie s 


ur part of the work of this problem, 
s d Fellow of the National Science 
= Leone eed Now at Miami University 
Found: : 


(Ohio). 


& Solomon, 1955; Marquart, 1955; 
Shaw, 1932; Taylor & Faust, 1952; 


Thorndike, 1938; and Watson, 1928). 
The advantage of groups may be a 
simple effect of pooling, "two heads 
are better than one," for in coopera- 
tive problem Solving the group gets 
credit for the first individual insight. 
Other factors such as interference be- 
tween group members or emergent 
group facilitation of thinking must be 
measured as deviations from the 
simple “pooling” hypothesis. The 
second aim of this Paper is to formu- 
late the pooling hypothesis so that it 
applies to distributions of solution 
times, 


MODEL or typ DISTRIBUTION 
OF SOLUTION Times 


Suppose that time is divided into 
short equal intervals of duration A. 
In any interval the subject may either 
solve the problem or not. Imagine 
that there exists a probability e that 
the problem is solved in the nth 
interval given that it has not been 
Solved previously. 'The probability ¢ 
is taken by hypothesis to be constant 
Over intervals, Then the probability 
that the problem is actually solved in 
exactly the nth interval is the prob- 
ability that it is not solved in any of 
the first » — 1 intervals (an event 
with probability (1 — $)"-) times 
the probability that it is solved in the 
nth interval, That is, 


P(n) = e(1 — gyn 


the geometric distribution. 
Positively skewed (J-sh 


[1] 


This is a 
aped) distribu- 
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tion with mean 1/¢ and variance 
(t — 29)/e. 

Tosay that eis constant over trials, 
that the probability of solving is in- 
dependent of how many times the 
subject may have failed to solve the 
problem, is in effect to say that 
problem solving is a one-stage, all-or- 
nothing process. This assertion is 
frequently unreasonable in that some 
problems clearly seem to involve more 


than one stage. 


Multi-stage Problems 


Suppose that to obtain the final 
solution the subject must actually 
solve a sequence of stages. The stages 
may be solved in fixed order, or it may 
be that the order of stages can be in- 
verted, but suppose that the subject 
actually can work on only one stage 
atatime. Furthermore, suppose that 
the time to solve an individual stage 
has the geometric distribution, and 
that the distribution of times to solve 
Stage 2 is independent of how long it 
took to solve Stage 1, etc. (Of course, 
it is assumed that work on Stage 2 
does not even start until Stage 1 is 
completed.) The distribution of times 
to solution for a & stage problem can 
be determined ; it isa random variable, 
and the sum of the & random variables 
which are the times to complete the 
k stages. If the k stages have the 
same probability of solution per in- 
terval, e, then the probability that 
the kth stage (and the whole problem) 
are completed at just trial JV, is given 
by the negative binomial (Pascal) 


distribution, 


P(N;k, e) 


An intuitive justification of this for- 
mula is given by Bush and Mosteller 


(1955), Ch. 14) and its derivation is 
discussed by Feller (1957, pp. 155- 
157). Note that Equation 2 reduces 
to Equation 1 when & — 1. The ex- 
pected number of time intervals to 
solution, the expectation E(N), and 
the variance of number of intervals, 
are given by 


E(N) = k/¢ [3] 
Var(N) = k(1— e)/e — [4] 


Although it is not likely that the 
various stages of a given problem 
would be equally difficult (have equal 
values of ¢), there is great difficulty 
in estimating g's of separate stages 
unless one has direct observations of 
the completion of various stages. 
Such analyses can sometimes be done 
but it is helpful to be able to deal 
directly with total solution time. For 
this purpose it is expedient to use the 
simplifying hypothesis that all stages 
are equally difficult, so as to be able 
to estimate the number of stages in 
the problem. 


Continuous Time 


The above remarks deal with short 
intervals of time, yet time to solve a 
problem may be measured continu- 
ously. If the interval % is taken 
smaller and smaller, then the prob- 
ability of solution within interval h, 
(h), also decreases but the ratio 
V (h)/h will (in any application of the 
present theory) approach a limit. Let 


À = lim (s&) 
hoo h 


and it can be shown by standard 
methods that for a one-stage process, 
the density function describing the 
distribution of problem solving times 
is 


SEN = MM [5] 


the exponential distribution; and in 
the case of k equally difficult stages, 


A = 
gli; A, k) = 9-134" o» * 6] 


the gamma distribution. The mean 
and variance of the gamma distribu- 
E 

tion are 


E(t) =k/A = u 
Var (t) = k/X = gt 


[7] 
[5] 
The exponential and gamma distribu- 
tions are conceptually just like the 
geometric and Pascal distributions 
given above; in fact they are limits 


taken by going to continuous time 
(intervals of duration zero). 


Estimates of Parameters 


A gamma distribution with pa- 
rameter k (k stages) is the distribution 
of the sum of k exponentially distrib- 
uted, independent, random variables. 
An exponentially distributed variable 
has mean equal to standard deviation, 
u = 6 = 1/), from Equation 7 and 8. 
When one adds independent random 
variables, the mean of the sum is the 
sum of the means, and the variance of 
the sum is the sum of the variances, 
With more stages, accidental fluctua- 
tions tend to cancel out somewhat, so 
that the standard deviation becomes 
smaller relative to the mean, the more 
stages are involved. With increasing 
k, the gamma distribution becomes 
more nearly symmetrical, and nar- 
rower. This property can be used to 
estimate the parameter k. Note that 
from Equations 7 and 8, 


HW = X 
and E 
a = xg 
so that w/o? = k [9] 
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Given large samples of observations, 
it is possible to substitute the obtained 
mean and variance, Í and s for the 
parameters, obtaining the estimation 
equation 


k= (Ds? [10] 


The authors have used this esti- 
mate, an example of the method of 
moments, instead of possible maxi- 
mum likelihood or minimum chi 
Square estimates. The method-of- 
moments estimate seems relatively 
insensitive to imperfections in the 
experimental procedure, and its gen- 
eral significance remains sensible even 
if the actual stages of the problem are 
not exactly equal in difficulty. If the 
stages differ in difficulty, then & can 
be considered as follows; the actual 
problem is represented by a model 
problem with equally difficult and 
independent Stages, and k is the num- 
ber of stages in the model. The inter- 
pretation is like one common in in- 
formation theory ; if a certain system 
transmits just 3 bits, it can be repre- 
Sented by a system which perfectly 
discriminates exactly 8 equally prob- 
able signals. The value of 3 bits may 
actually be derived from a system 
Which makes imperfect discrimina- 
tions of 24 categories which are not 
equally probable, but the model gives 
a general impression of the behavior 
of the system. Similarly, our use of a 
gamma distribution gives a general 
impression of the number of stages in 
a problem, of its "complexity," with- 
Out necessarily being correct about 
the detailed structure of the problem. 


A pplication 


Davis (1961) gave three problems 
to 178 undergraduates, and timed 
each subject on each problem. The 
order of the three problems was 
changed from subject to subject. A 
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sizable fraction of the subjects arrived 
at wrong answers on some problems, 
and the wrong answers will be set 
aside; even if there are exactly k 
stages to solution of the problem, 
there is no assurance that it also takes 
k stages to arrive at a wrong answer. 

On the basis of subsidiary measure- 
ments, a relatively constant “reading 
time” could be subtracted from the 
total time consumed on each problem. 
When this was done, means and 
standard deviations of net solution 
time were calculated on each of the 
three problems. 

The first (Rope) problem described 
a prisoner attempting to escape from 
a tower, who found in his cell a rope 
which was half long enough to permit 
him to reach the ground safely. He 
divided the rope in half and tied the 
two parts together and escaped. How 
could this have been? The answer is 
that the rope is divided lengthwise. 
This problem was taken to be a "'one- 
stage" problem in that the solution 
seems to depend on a single idea. 

The net mean time was 131.3 
seconds, the standard deviation was 
115.5, whence the estimated number 
of stages is (131.3)?/ (115.5)? = 1.3 
stages, not far from the one stage 
predicted. In fact, it is possible that 
the time taken to write the solu- 
tion might produce the modest dis- 
crepancy. 

The second (Word Tangle) problem 
was a more complex one: “TE the 
puzzle you solved before you solved 
this one was harder than the puzzle 
vou solved after you solved the puzzle 
vou solved before you solved this one, 
was the puzzle you solved before you 
solved this one harder than this one?" 
Subjects were to answer yes or no and 
why. The answer (yes) requires 
canceling of several mutually defeat- 
ing phrases. Net mean time was 264.8 
seconds, with a standard deviation of 


154.1, so that the estimate of k is 
3.0 stages. 

The third (Gold Dust) problem was 
a more complex version of the stand- 
ard water-jug problem. Given con- 
tainers of 163, 14, 25, and 11 ounces, 
obtain exactly 77. Solutions are to 
subtract, from 163, either 2(25) + 14 
-- 2(11),/0r 25°F 204) a- 3(11), 1e; 
either five or six manipulations. The 
net mean time was 373.0 seconds, 
standard deviation 166.7, whence the 
estimated number of stages isk = 5.0. 

The estimates of number of stages of 
of these problems seem, to the present 
writers, quite in accord with our 
intuitive judgments of number of 
stages. When a group of students, 
after having solved the problems, were 
asked to estimate the number of 
stages in the three problems, their 
mean judgments were 2.8, 4.0, and 6.3, 
respectively. These judgments are 
consistently higher than those ob- 
tained from the distributions of solu- 
tion times, but the correlation is re- 
markable and indicates that our 
judgments as to the relative number 
of stages in the three problems agree in 
substance with judgments of student 
solvers. 

In addition, it was found that the 
gamma distributions (Equation 6) 
give satisfactory fits to the obtained 
distributions of solution times on the 
three problems. The obtained cumu- 
lative distributions of time to correct 
solution of the three problems are 
shown in Figure 1, along with fitted 
gamma distributions. An investiga- 
tion of overall goodness of fit of the 
theoretical gamma distribution was 
made separately for each problem, 
using the Kolgomorov-Smirnov one- 
sample test. The maximum dis- 
crepancies between theoretical and 
observed distributions were all well 
within the .05 acceptance region for 
all three problems. 
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Meaning of the Model for Distribution 
of Solution Times 


The model for the distribution of 
solution times assumes that the times 
subjects take to solve a problem vary 
widely because solving a problem 
involves having only one or a few 
ideas, in sequence; and the time to 
an idea is taken to be a random 
sen iable. This seems a natural con- 
varia M thinking of such a problem 
seque y process as "insightful." The 
iw eme includes other, less 
desse ve, simplifying assumptions. 
attrac pe the stages of a problem are 

One 15 j ily difficult. Another is that 
all eme parameter à applies to all 


the sal the population tested. 


members of 


Cumulative relative frequency of individu. 
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al solvers and fitted gamma distributions 
roblems, respectively. 


"These assu mptionsare probably wrong 
in detail, though small deviations may 
not be easily detectable from the kind 
of data we have used. It is empha- 
sized that they are only simplifying 
assumptions which are made, in this 
model, because we were not able to 
Separate and Measure the stages of 
solution of the problems (because we 
did not, originally, know what they 
might be, and we are not sure now), 
and we did not have good individual 
differences Predictors for our subjects. 
POOLING or ACCOMPLISHMENTS 
IN Groups 
In the us 


1 ual experiment on problem 
Solving by 


Sroups, groups are better 


p—————————— —— ~ 
ioe———/; —Ó ae —n 
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than individuals. If “Eureka” prob- 
lems are used, the group is likely to 
solve successfully as soon as the first 
group member arrives at the correct 
solution. If it is assumed that group 
members are just as effective in groups 
as they would be working alone, then 
it is possible to predict group from 
individual performance, provided that 
individuals and members of group are 
sampled randomly from the same 
population of subjects, and groups are 
formed randomly. 

The one model available for such 
predictions is by Lorge and Solomon 
(1955) who divide the population into 
“solvers” and “nonsolvers, use tests 


+90 


LATIVE FREQUENCY 


CUMULATIVE RE 


of individuals to estimate Pr (the 
proportion of solvers among indivi- 
duals) and then predict that the 
probability of finding at least one 
solver in a group of size r will be 
p; = 1 = (1 — Py)’. Using this 
theory it is possible to transform the 
cumulative distribution of individual 
solvers, which gives Pr as a function 
of 4, into a cumulative distribution for 
groups of size 7, though Lorge and 
Solomon do not do so. 

Lorge and Solomon also note that if 
a problem has more than one part or 
stage, it is possible that a group will 
solve it though no individual in the 
group can solve it. Different group 
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members may be able to Solve differ- 
ent parts, and then pool their abilities 
to obtain an emergent solution. If 
the abilities to solve the various parts 
are equally frequent in the population 
and are independent, this gives a 
decided extra advantage to groups. 
With k equally probable Stages the 
probability of a group solution with 7 
people is: 


P = [1 = (1 = Ppr j [11] 


Equation 11 also can be used to 
generate a distribution of group 
solution times given a distribution of 
individual solving times, for any 


.. alue of k. " : 
oe of the waiting-time model 


for the distribution of solution-times 
vields an estimate of k for a problem, 
from. the distribution of individual 
solution times. That estimate can be 
inserted for k in Equation 11, and 
group performance can be predicted 
from individual Performance. Davis 
(1961) Performed the appropriate 
experiment. In addition to the 178 
individuals, he had 22 four-person 
Sroups work on his three problems. 
Recruitment of subjects, working con- 
uctions, etc., were closely 

comparable for individuals and groups. 
en Equation 11 is used to predict 
group from individual solution times, 
the results are very inaccurate. Fig- 
ure 2 shows the predicted group 
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performance (from Equation 11) and 
the obtained distributions on the 
Rope, Word Tangle, and Gold Dust 
problems for k-1andk-2. The 
actual estimates of k were 1.3, 3.0, and 
5.0, which would obviously give ex- 
tremely inaccurate estimates. 
However, deeper consideration 
shows that the Lorge-Solomon model 
is not the proper one to be used with 
the waiting-time model developed. 
Lorge and Solomon begin by dividing 
the population into solvers and non- 
solvers. To obtain more complete 
predictions, We extended the idea to 
“solvers by time t” and "nonsolvers at 
time ¢,” ‘which seems à legitimate 
extension of the ideas of Lorge and 
Solomon. However, We earlier had 


generated the distributions of solution 
time from the assumption that solu- 
tion was a random event, and all 
subjects had equal probability of 
solution in a given interval. The 
Lorge-Solomon model takes the in- 
dividual performance as à fixed fact 
which identifies solvers and non- 
solvers, assumed to be individuals 
with different abilities. The random 
waiting-time model leads to a dif- 
ferent, combination-of-contributions 


model. 


Combination of Contributions 
Consider a group of subjects none of 
whom misinterprets the problem or 
arrives at a wrong solution." Suppose 
that for each subject the problem 
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involves the same k stages each of 
which has an exponential distribution 
of time-to-completion of Ae", With 
r subjects in a group, it can be shown 
that the distribution of group per- 
formance is given by the gamma 
distribution, 


g(t;r^,k) = 


Th —rht k—1 y) 
E i^ (r£) [12] 
which is precisely the same gamma 
distribution derived for individual 
performance except that the rate (A) 
is increased by a factor of r, the size 
of the group. Part of the proof is 
given. Within a short interval h, 
given that no one has solved a certain 
stage, the probability that some one 
in the group will solve is 1 — (1 — wr. 
As h is taken small, w becomes small, 
in which case 


lu —3)92s1 — 1 dp 


r " r E 
— (ve (Du 


approaches rv. The reason is that all 
terms involving WW, etc. become 
relatively negligible when y becomes 
small. Hence, in the limit, the dis- 
tribution of time to a given stage 
approaches rà. 
This calculation provides a predic- 
tion of group problem Solving if no 
subjects arrive at wrong solutions, 
Preliminary results of an experiment 
by R. Hoppe at Michigan State 
University! give some support for this 
model. Hoppe used Rokeach's (1960) 
"Doodlebug" problem and tested 
individuals, and groups of Size 2 and 3. 
In Iloppe's study the subjects could 
ask the experimenter questions and 
were, in the main, redirected when 
thev had made a serious adds so 
that all subjects and groups either 


à ersonal communication, 
1 Ronald Hoppe, persor 


1961. 


solved the problem correctly or ran 
out of time. The data were in agree- 
ment with the hypothesis that all 
time distributions were of the same 
Shape (it is not possible to make a 
good test of the gamma distribution 
Since a substantial proportion of 
subjects failed to finish) and that the 
rate of solution is roughly propor- 
tional to the number of people in the 
group. 

In Davis’ (1961) experiment the 
subjects received no redirection from 
the experimenter, and about half of 
the individuals arrived at wrong 
answers on the two harder problems 
(Word Tangle and Gold Dust). The 
eflect of wrong answers in the group 
situation is not directly known, since 
Observations did not enable the ex- 
perimenter to tel] which members of 
groups had solved the problem. How- 
ever, two plausible hypotheses can be 
formulated, and they lead to two 
quite distinct Predictions regarding 
Sroup performance, 

Hierarchical model. t ig possible 
that subjects going "off the track," 
referred to here as nonsolvers, are 
nonfunctional in their groups. If so, 
a group with two solvers and two non- 
solvers will solve the problem just as 
rapidly, on the average, as a group of 
two solvers alone, (It is assumed that 
in Eureka problems, groups will not 
agree on a wrong solution.) Non- 
solvers would be nonfunctional if, for 
example, the solvers suppress non- 
solvers and form an intellectual hier- 
archy within the group. 

If a group of size 7 has -1 solvers and 

7? — A subjects who are non- 
Solvers, then by the Hierarchical 
model the distribution of time to 
Solution should be a gamma dis- 
tribution, g(t; Ad, k). Ifa proportion 
@ of individual workers solve the 
Problem, then the probability that a 
8roup of size r has exactly 4 members 
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who will solve is 
P(A) = (4)«a — ay-^, [13] 


a binomial distribution. 

The distribution of solution times of 
a collection of groups will then be 
described by the density function, 


fut = E Pel Ar E [14] 


if the hierarchical model is correct. 

Equalitarian model. Another possi- 
bility is that those subjects who go off 
the track, the nonsolvers, continue to 
take full part in group discussion 
though they do not arrive at solutions. 
This idea leads to the hypothesis that 
the nonsolvers would consume their 
share of the group's time, slowing up 
the other members of the group. This 
would mean that the probability 
that a solver in the group solves in 
unit time is reduced from A to 
[A/(A + B)] when there are A 
solvers and B nonsolvers present. 
The distribution of solution times for 
such a group is, by the argument 
leading to Equation 12, 


Given that the probability of an in- 
dividual's solving is a, the distribution 
of solution times of a collection of 
groups will be described by the density 


function, 
fs (i) 

= L [reti is M 3] [16] 
if ihe equalitarian model is correct. 


The evidence of Davis’ (1961) ex 
periment is that the hierarchical 


model is seriously in érror, and the 
equalitarian model vields predictions 
which are quite close to the data, not 
seriouslv in error. The Kolgomorov- 
Smirnov one-sample test leads to 
rejection of the distribution calculated 
from the Hierarchical Model, for the 
word tangle and gold dust problems, 
at the .05 level of significance. The 
distribution derived from the Equali- 
tarian Model is accepted at the .05 
level for all three problems. Of 
course, with only 22 groups, the sta- 
tistical acceptance is not decisive. 
Predictions from these two combina- 
tion-of-contributions models, along 
with the distributions of group per- 
formance, are shown in Figure 3. 
Subsidiary evidence in the form of 
sociometric choices and communica- 
tion frequencies (Davis, 1961) was 
collected, and the results agree in 
substance with the assumption of an 
equalitarian group structure. 


DISCUSSION 


Structure of Problems 


The estimates of number of stages 
of the Rope, Word Tangle, and Gold 
Dust problems would be very im- 
pressive evidence, except that we have 
no strong independent measurement 
of the number of stages in such prob- 
lems. The intentions and judgments 
are suggestive but not decisive. The 
number of stages in a problem is the 
number of distinct ideas required to 
obtain the solution. The number of 
stages is the number of ideas which 
must be conjoined to obtain the 
solution—the subject must have Idea 
1 and Idea 2 and . . . , etc. If there 
were alternative answers, the subject 
could have Idea 1 or Idea 2 or . . . , 
and these would not be stages. This 
analysis makes it possible to compare 
stage-estimates from the data with the 
logical structure of the problems; and 
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the agreement is good. Unfortu- 
nately, we are not sure how to identify 
“ideas” in the sense used above, hence 
the logical analysis is not sure of its 
elements. M : 
What is needed, obviously, is an 
experimental separation of Stages in 
problem solving. Hoppe's work, men- 
ioned above, attempts such an anal- 
oe A difficulty is that one can 
om separate the stages of a 
n. experimentally, unless he 
aen what those stages are. Verbal 
Landa "of problem solving process 
protoc ful for identifying parts, but 
d pent difficulties of interpreta- 
they 


ative relative fre 
roups of Size 4, 


quency predicted by the Hier- 


(Observed group performance is given 


tion. The writers do not foresee any 
easy and immediate answer to the 
question of problem structure. b 
tages versus parts. In the special 
model above it Was assumed that the 
Stages are to be accomplished in 
Sequence. If there are simply k parts 
which must be accomplished, in any 
order, then the distribution of times 
may be'changeq ^ jy the subject i$ 
thought to Work on only one stage at 
a time, Solving the Stages in some 
irregular order, then the gamma dis- 
tribution model applies. However, 
if ideas for different parts of the 
Problem can arise freely and in- 
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dependently, a new distribution arises. 
Solution time of any part will be 
exponentially distributed, but until 
the subject has solved the first part 
there are k parts which may be solved. 
Hence the rate until the first part- 
solution is kX. After one part is com- 
pleted, there are k — 1 parts left and 
the rate until the second part is 
(k — 1)d. Finally, when there is only 
one part left, the rate is à. These 
“rates” can be defined as above if the 
distribution of times until the next 
stage is assumed to be exponential.° 
The resulting distribution 1$ rather 


5'Dhe writers ar 
Rubin for the formul 
mathematical problem. 


e indebted to Herman 
ation and solution of this 


complicated, but its mean and vari- 
ance can be developed easily. Since 
the mean of each of the waiting times 
is independent of the others by hy- 
pothesis, the total mean is the sum of 
k waiting times with means TN 
1/ (2A), +++, 1/ (kà). Hence the mean 
total time to solution, E(T), is 


E(T) 7 (1/9 
X (1+1/2+1/3+---+1/k) [17] 


By a similar argument, the variance 
of total times is 


Var (T) = (1/X) 
x[t4- (1/2)?+ (1/3) + 
E] [8] 
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urrently) 


AGES 


Number ie a uiii 
of parts 
or stages! ports | Stages! Parts Stages| Parts | Stage 
1 [1.00 | 1.00 | 1.000| 1.000! 1.000! 1.00 
2 [1.50 | 2.00 | 1.108] 1.414| 1.800] 2:00 
3 |1.83 | 3.00 1.117] 1.731| 2/469| 3.00 
4 |2.08 | 4.00 | 1.119) 2.000| 3.019] 400 
s |2.28| 5.00 | 1.210| 2.238] 3.562| 5.00 
6 12.45 | 6.00 | 1.221] 2.452] 4.025) 6.00 
7 2,59 | 7.00 | 1.229] 2.642) 4.447) 7.00 
& [2:72 | 8.00 | 1.236] 2:828| 4.836| 8,00 
5 283 | 9.00 | 1.241] 3.000] 5.198| 9.00 
io [2:93 [10.00 | 1.245] 3.161| 5.536 10.00 
| 


Table 1 shows how mean and 


standard deviation, and also the value 
of p?/o2, Progress with more and 
wore parts. Comparable values for 
Stages,” ie, parts which must be 
Solved in Succession, are shown for 
contrast, 

Possible effects of wrong solutions. 

€ Equalitarian model for predicting 
group from individual performance 


ay distort the distribution 
to correct solution. |n 
the number of stages tO 
d working out the waiting- 


estimating 
Solution an 


" — X — — 
OE 


n—————À— 
= J—p————( 


O_o E 


PROBLEM SOLVING BY INDIVIDUALS AND GROUPS 


time model, subjects who arrived at 
wrong solutions were eliminated from 
the data. This seems reasonable, for 
the number of stages to a wrong 
solution may be different from the 
number of stages to a correct solution. 
However, this conditional distribution 
of times (times to solution given that a 
correct solution is reached) may be 
misleading. The difficulty is that a 
subject who happens to take a long 
time to get the correct answer has 
an increased vulnerability to error, 
whence long times to correct solution 
may be systematically less frequent 
when errors occur than they would be 
if the whole distribution were known. 

The difficulties can be studied by 
investigating the properties of random 
systems with more than one outcome. 
The most general model available 
to the present writers is that proposed 
by LaBerge (1960)9 which permits 
any of several responses which involve 
by Rs... stages respectively. La- 
Berge derives the distribution of times 
to completion of the kı stages of Out- 
come 1 provided that Outcome 1 
occurs. 

One special model may be 
the “misreading” model, in which it 
is supposed that the subject may, at 
the very beginning, make a choice 
which leads to a wrong solution. The 
subject either enters into a process 
which leads to success (with the 
gamma distribution of Equation 8), 
or onto some other which leads to 
failure. The conditional distribution 
of times to correct solution, given that 
a correct solution and not an error is 
attained, is the original gamma dis- 
tribution. 

A third possibility is that the 
Subject may at any time before solu- 
tion make a decision which leads to a 


called 


LaBerge, D. A statistical theory of 
iscrimination behavior. (Prepublication 
draft, December 1960.) 
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wrong answer. Consider that this 
wrong decision is a unitary event 
which has a constant probability of 
happening at any time, given it has 
not already happened, up until the 
problem is solved. Then the acci- 
dental event will have an exponential 
distribution. The density (prob- 
ability per time interval) of solution 
of the problem will be the product of 
the original gamma density, g(t; X, k) 
and the probability that the acci- 
dental event has not occurred. If the 
constant probability-per-unit-time or 
rate of the accidental event is a, the 
probability that it has not occurred by 
time ¢ (provided the problem is not 
solved before f) is c-**. The condi- 
tional density of solution at time £, 
given that a correct solution is at- 


tained, is 


f(E; N, a, k) 


] se V, Bed 
0 


3-5 g- O9 t (Ar 


The numerator is simply 


etad (saz) Gu 


and the denominator is 


À e pee 
(3) Í g(t; Ma, E)dt 


ner 


The step from Equation 20 to Equa- 
tion 21 is accomplished by noting that 


g(t;\ + a, k) 
A+a 
(k — 1)! 
A + aie e 


[19] 
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If this is multiplied by [A/(^ day 
one obtains the numerator of Equa- 
tion 20. am Bu 

Hence the conditional distribution 
of solution times is 


fü ;Nok)-—g;A-rak) [23] 


a gamma distribution with the original 
number of stages, &, but with rate in- 
creased to A + a. The probability of 
a correct solution is [A/ (4 + a) Jt. 

In summary, three effects of wrong 
solutions were mentioned ; they might 
distort the distribution from a gamma 
to some other distribution (if the 
subject works concurrently on stages 
of several different processes leading 
to several answers), it might leave the 
conditional distribution of solution 
times unchanged (if errors arise only 
through an initial wrong decision such 
as misreading the problem), or it 
might leave the conditional distribu- 
tion a gamma, still with $ stages, but 
increase the rate of solutions, (if entry 
onto an erroneous path is thought of 
as a unitary random event.) Of these, 
the third seems most in accord with 
the problem solving situation in 
Davis’ (1961) experiment. 

Unhappily, we do not have the data 
needed to decide which of these possi- 
bilities, if any, occurred in the Davis 
(1961) experiment. As we under- 
stand it, the LaBerge formulation 
could be fit to the data exactly only 
if we knew what all the wrong solu- 
tions were and could estimate the 
number of stages and the rate pa. 
rameters of each. Davis’ experiment 
was not designed to facilitate such an 
investigation, and the necessary anal- 
ysis is not feasible. Since LaBerge’s 
model involves parameters we cannot 

estimate, it cannot be compared with 
the other models validly. The mis- 

ding model and the accidental 
e t model do not require analysis of 


even inner structure of erroneous 


the 
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solutions, hence can be applied to the 
present data. Since both lead to 
gamma distributions of correct so- 
lution, both agree with Davis’ re- 
sults. Only a detailed qualitative 
analysis of protocols, or a technique 
for eliminating wrong solutions with- 
out otherwise disturbing performance, 
would indicate which model is correct. 
In fact, some subjects might misread 
and others err through an accidental 
random event, and the conditional 
distribution of correct solution time 
would still be a gamma distribution. 
This theoretical result increases the 
Plausibility of the gamma distribution 
as an hypothesis by showing that it 
might, under reasonable conditions, 
characterize data from problems with 


quite different frequencies of wrong 
solution. 


OTHER Moprrs FOR OTHER 
KINDS or Groups 


The predictions of group perform- 
ance in Davis’ experiment were for 
ad hoc groups of equivalent members; 
with no discernable social inequalities. 
Furthermore, the nature of the prob- 
lems was such that ideas were pooled 
with little controversy over the cor- 
rectness of a correct proposal. In à 
"team" it sometimes happens that all 
members must solve their parts of the 
Problem before the group succeeds. 
Then the probability of group solution 
is the joint Probability that all mem- 
ers have completed their tasks bY 
time £ If the individual assignments 
are unitary problems with the same 
difficulty, and all members of the 
group can work concurrently (as !" 
Servicing .an automobile) then the 
model leading to Equation 17 and 1 
should describe the Situation. If one 
Stage of the process must wait upon 
Completion of another stage, then 
combining the individua] contribu- 
tions can lead to a gamma distrib" 
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tion. lf solution of the problem is 
mainly a matter of decision rather 
than invention, then the probability 
of decision by time 7 may be the 
probability that all group members 
accept the same idea at that time. 
Changes of opinion, persuasion, and 
the invention of compromises are the 
events of primary interest, and the 
main results of the present paper do 
not apply. 

These remarks emphasize that the 
mathematical theory used in this 
paper is not a general theory of group 
problem solving, but is a careful and 
precise formulation of some hypotheses 
about one particular kind of group 
problem solving. The hypotheses de- 
pend both upon the constitution of the 
groups and the nature of the problems. 


RELATION TO OTHER AREAS 
or PSYCHOLOGY 


The mathematical forms discussed 
in this paper have applications else- 
where in psychology. The notion that 
an individual stage of problem solving 
will have an exponential distribution 
of solution time, is essentially the 
same as the hypothesis that a unitary 
learning event will occur on a geo- 
metrically distributed trial. Both 
arise from the statement that the 
event has constant probability of 
occurring, until it does occur. In 
learning theory, this is the hypothesis 
of "all-or-nothing" learning (Bower, 
1962; Estes, 1960; Restle, 1961b, 
1962). A truly unitary process, in- 
divisible and independent of other 
processes, should have a constant 
probability o^ occurrence. If an event 
is not unitary but a compound, then 
every effort should be made to tease 
apart the stages involved in it. 
Hence the geometric and exponential 
distributions (for trials and continu- 


en 
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ous time, respectively) give a theory 
for any event which can properly be 
left unanalyzed. 

The only compounds which do not 
require detailed analysis into parts are 
those made up of equivalent parts. 
A beam of light can be thought of as 
a compound of unitary emissions of 
photons, but the only interesting 
question about such a beam is the 
number of photons, since separate 
photons are all alike. Thus, an ideal 
compound event must be one made up 
of equivalent, identically distributed 
parts. The gamma distribution de- 
scribes compounds of the type used in. 
this paper. Previous applications of 
gamma distributions include Bush and 
Mosteller (1955, Ch. 14) on the time 
rats take to traverse a runway, and 
Restle (1961a) on simple reaction 
time. 

Similarly, the simplest kind of 
model for comparing group with in- 
dividual performance is one which 
assumes that people in a group are, in 
all ways possible, just like people 
working alone. In a nonevaluative 
sense one can say that the Lorge- 
Solomon models, and the variations 
given in the present paper, are all 
versions of this ideal hypothesis. 

Any deviations between model and 
data suggest the action of factors more 
complex (and perhaps more interest- 
ing) than the very simplest possi- 
bilities discussed here. For example, 
since Davis' groups did not reach the 
level predicted by Lorge-Solomon 
models, it became possible to intro- 
duce the hypothesis that individuals 
who have entered on a wrong inter- 
pretation of the problem take up their 
share of group time and interfere with 
the progress of others. Discrepancy 
from one model became a measure- 
ment of a previously unmeasured 


factor. 
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SUMMARY 


A waiting-time model was devel- 
oped to give a distribution of times to 
solution, for word puzzles. The dis- 
tribution depends upon the number of 
stages involved in solving the problem. 
Experimental results showed that 
the theoretical distribution (gamma 
distribution) gives a good fit to the 
data, and the estimated number of 
stages agrees well with intuition and 
judgments. A theory of "pooling of 
contributions” was developed to de- 
scribe the superiority of groups over 
individuals in solving word puzzles, 
Experimental results suggested that 
subjects who have misunderstood the 
problem and will arrive at wrong 
answers, nevertheless consume their 
share of group time. With this addi- 
tion, the pooling of contributions 
theory accurately predicts group prob- 
lem solving data. Some alternative 
models, resulting from different prob- 
lem structures and different kinds of 
groups, were discussed, and the wait- 
ing-time theory was shown to be 
related to the concept of “all-or- 
nothing" learning. 
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THE MOLECULAR BASIS OF MEMORY AND LEARNING 
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In recent papers, Gaito (1961) and 
Hyden (1961) have suggested that nerve 
activity results in a rearrangement in 
the sequence of monomers of certain high 
molecular weight polymers (nucleic acids 
and proteins) found in brain cells, or in 
the synthesis of additional polymers of 
novel sequences, and that this rearrange- 
ment or synthesis is the molecular basis 
of memory. It is the purpose of this note 
to show that the only mechanisms 
whereby such rearrangements or syn- 
theses could occur predict chemical brain 
changes incompatible with present ex- 
perimental evidence, and, moreover, that 
current biochemical knowledge of nucleic 
acids renders the hypothesis almost cer- 
tainly false. The evidence brought for- 
ward in support of polymer sequence 
rearrangement is then shown to be com- 
patible with a second hypothesis that is 
not contrary to current biochemical views 
on biopolymers. 

There are three main types of poly- 
mers? in all cells, including neurones. 
The deoxyribonucleic acids (DNAs) are 
confined to the nucleus and, in the form of 
nucleoproteins, are the major constituents 
of the genes. There are two types of 
ribonucleic acids (RNAs). A soluble 
form (messenger-RNA ) is apparently 
synthesized in the nucleus and then trans- 
ferred to the cytoplasmic ribosomes. 
These structures contain insoluble RNAs 
and are the site of protein synthesis. 

The interrelations of DNA, RNA, and 


protein are relatively simple. Every cell 


contains at least one type of DNA mole- 
cule for every gene. These molecules are 


1Details of the biochemistry of DNA, 
RNA and their relation to protein biosyn- 
thesis can be found in numerous publications, 


e.g. Abrams (1961), Leslie (1961). 


chemically identical to the DNAs of the 
parent cells. Each DNA molecule is a 
long, unbranched chain of nucleotides, 
which usually occurs chemically bonded 
to a second molecule in the form of a 
twin helix. Replication of a DNA mole- 
cule occurs only at cell division when the 
two components of a helix separate and 
each serves as a template for the synthesis 
of a new polynucleotide to form two new 
helices. 

As these processes occur only at cell 
division, it is clear that the DNA mole- 
cules in the nucleus of a neurone are 
permanent structures, and there is no 
experimental evidence of any significant 
change in quantity with time. The base 
sequence of any particular DNA molecule 
is determined by the base sequence of the 
template on which it was formed (Korn- 
berg, 1960). It has been demonstrated 
that no polynucleotide synthesis occurs in 
the absence of a template (Kornberg, 
1960). 

This evidence rules out changes in the 
base sequence of DNAs as molecular 
“memory banks." There is no way in 
which electrical changes in the cell walls 
of neurones could interfere with the proc- 
ess of polynucleotide synthesis to give 
new base sequences. DNA is formed 
only on a template, spontaneous coupling 
of nucleotides does not occur. Hence, 
even if DNA molecules of the nucleus 
were fragmented by an electrical impulse, 
they would not recombine into polynucleo- 
tides of different base sequence. 

The only known mechanism for alter- 
ing the base sequence of a DNA molecule 


2 Total brain DNA reaches a minimum in 
out age 30 years, then steadily in- 
Changes in individual brain cell 
at present un- 


man at ab 


creases. S 
DNA concentrations are 


investigated. 
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is by reaction with a mutagenic agent. 
These are compounds which chemically 
alter the purines and pyrimidines of the 
polynucleotide (see, for example, Freese, 
1961). The effect of a mutagen is to 
alter the base sequence not by a rear- 
rangement of the existing bases, but by 
the conversion of some of the bases in situ 
on the polymer molecule to other related 
compounds, not normally found in DNAs. 
It could be hypothesised that nerve im- 
pulses produce chemical mutagens in vivo. 
However, this cannot be true for no un- 
usual bases have been found in brain 
DNAs, nor incidentally, in brain RNAs. 

Nuclear DNAs continually serve as 
templates for the synthesis of messenger- 
RNA. The base sequence of the DNA 
template determines the base sequence of 
the messenger-RNA formed on it (Weiss 
& Nakamoto, 1961). Again, there is no 
opportunity for electrical impulses to in- 
terfere with the process to give a messen- 
ger-RNA of base sequence undetermined 
by its DNA template. The steric rela- 
tionships necessary for synthesis on the 
template are absolutely specific for par- 
ticular bases (Abrams, 1961), 

After transfer to the ribosomes, the 
RNA serves as template for protein bio- 
synthesis. The sequence of amino acids 
in any particular type of cell protein (in- 
cluding enzyme proteins) is determined 
directly by the base Sequence of the RNA 
on which it forms, and, of course, indi- 
rectly by the base Sequence of the DNA 
on which the RNA was formed (Speyer, 
Lengyel, Basilio, & Ochoa, 1962: Sueoka, 
1961). 

It is quite clear that the only way in 
which electrical impulses could disrupt any 
of these template mechanisms to give al- 
tered monomer sequences is by the forma- 
tion of a mutagen. The absence of unusual 
bases in brain nucleic acids precludes this 
possibility. Moreover, even if a change in 

olymer base sequence was caused in a 
p cd by some outside influence, the 
cok would probably not have the re- 
A desired by Gaito (1961). Thus, 
= he alteration in monomer sequence 
m ers one, which would certainly 
bg et for the cell due to the pro- 


duction of foreign protein, the only ef- 
fect would be some small change in the 
amino acid sequence of a protein. To be 
of any consequence this would have to 
be an enzyme protein. If the change oc- 
curred at the active center, the enzyme 
would lose its catalytic properties and be 
useless to the cell. If the change oc- 
curred elsewhere in the protein molecule 
the chances are greatly against any sig- 
nificant alteration in properties (Fincham, 
1960). 

In Summary, one may say that the hy- 
pothesis of brain polymer sequence altera- 
tion as the molecular basis of memory is 
unlikely because the known template 
mechanisms of polymer biosynthesis pre- 
clude any such changes, and even if such 
changes did occur they would, if large, 
lead to a complete disruption of cell ac- 
tivity, or, if small, to no significant 
change, 

Experimental evidence in support of 
the polymer rearrangement hypothesis of 


memory has been published by several 
workers, 


total brain RNA occur with age (Cam- 
eron & Solyom, 1961), that the adminis- 
tration of structural analogs of purine 
bases affects rat maze learning (Dingman 
& Sporn, 1961), that treatment with ribo- 


1961; Hyden, 1959, 1961). 

Considered as a whole, this evidence 
definitely demonstrates a role for RNA 
in the molecular Processes basic to mem- 
ory and learning, But, on logical 
grounds, it is necessary first to demon- 
strate that this role is not connected with 


the well-established function in protein 
biosynthesis, before 
unction involyj 


grounds. Tti 


ing discussion to propose that the mo- 
lecular basis of memory involves changes 
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in enzyme concentration in brain cells, 
and that these changes necessarily involve 
the observed alterations in RNA metab- 
olism. 

It is a well-known principle of genetics 
that all somatic cells contain the same 
genes. Each gene (consisting of DNA) 
controls the synthesis of a particular en- 
zyme (Leslie, 1961). Hence every body- 
cell has the potential to make the same 
set of enzymes. But it is quite clear that 
there are large enzyme differences be- 
tween cells. Some lack enzymes pos- 
sessed by others, while the concentration 
of a particular enzyme varies widely be- 
tween cells of different tissues (Dixon & 
Webb, 1958). It is apparent that the 
genes of some cells are inactive; in the 
sense that they are not participating in 
protein biosynthesis to any significant 
extent: while others are not functioning 
at their maximum rate. There must exist 
within cells a mechanism (or mecha- 
nisms) to initiate and control the rate of 
DNA, messenger-RNA, and messenger- 
RNA-protein syntheses. The chemical 
basis of these mechanisms is still obscure, 
but many enzymes are known to be in- 
duced by the presence of their substrates 
(Dixon & Webb, 1958). It is possible 
that the presence of substrate within a 
cell initiates synthesis, while the concen- 
tration of substrate determines the rate 
at which a particular enzyme is produced. 
This mechanism could be basic to the 
biochemical basis of memory and could 
operate in the following manner. 

The synapses of the brain are polyfunc- 
tional, in that several axons terminate 
at the dendrites on any particular neu- 
rone. For this neurone to transmit an 
impulse it must be stimulated. The stimu- 
lus, which is chemically transmitted, can 
come either from a single axon, or from 
combination of axons discharging to- 


a 
gether. Whether or not the impulse is 
transmitted depends upon whether the 


f released transmitter sub- 
he critical level to stimu- 
late the next neurone. Now, the amount 
of transmitter substance manufactured 
and stored by a neurone must depend 
upon several factors: including the activ- 


concentration o: 
stance reaches t 


ity within the cell of the biosynthetic en- 
zyme system for the substance. If the 
enzymes of this system are inducible, the 
exchange of metabolites and demands for 
transmitter substance on repeated stimu- 
lation will increase the neurone's ability 
to manufacture the substance. Hence on 
the first few stimulations a neurone may 
not release sufficient transmitter substance 
to produce a postsynaptic response, but 
the amount of substance released on sub- 
sequent occasions will be sufficient. 

If this hypothesis is incorporated with 
the concept of polyfunctional synapses 
and interconnecting neurone pathways in 
the brain, it becomes clear that an in- 
creased amount of transmitter substance 
released by any axon following repeated 
stimulation could be the major factor in 
establishing and maintaining the particu- 
lar nervous mechanisms of memory and 
learned behavior. 

Experimental investigation of the hy- 
pothesis could be attempted by a determi- 
nation of whether the enzyme systems for 
the biosynthesis of nerve transmitter sub- 
stances are inducible, and whether there 
is any increase in the amount of trans- 
mitter substance released by an axon 
following repeated stimulation, The re- 
cent reports of variations in cholinesterase 
activity with age, strain, individuals, and 
after learning (Aprison & Himwich, 
1954; Bennett, Krech, Rosenzweig, Karls- 
son, Dye, & Ohlander, 1958; Bennett, 
Rosenzweig, Krech, Karlsson, Dye, & 
Ohlander, 1958; Krech, Rosenzweig, & 
Bennett, 1956, 1959; Krech, Rosenzweig, 
Bennett, & Longueil, 1959) in the brain of 
rats, could well be changes induced by in- 
creased amounts of the transmitter sub- 
stance acetylcholine, which is being 
formed by neurones in larger quantities 
due to increased choline acetylase activ- 
ity as suggested by the above hypothesis. 

Tt will be seen that the hypothesis sug- 
gested here, that the molecular basis of 
memory and learning is basically induced 
changes in the enzyme systems involved 
in the formation and degradation of nerve 
transmitter substances, unlike the hy- 
pothesis of nucleotide sequence rearrange- 
ment, makes no proposal contrary to 
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biochemical views. Moreover, 
ue. ed concentration of specific en- 
zymes necessarily implies increased ac- 
tivity of the RNA-dependent biosynthetic 
systems for these enzymes. In fact, cer- 
tain definite changes in RNA metabolism 
are predicted by the hypothesis. These 
are outlined below. : . 

Increases in enzyme concentrations im- 
ply increases in RNA. Considering the 
central nervous system as a whole, learn- 
ing implies the continual establishment of 
new interconnecting neurone pathways. 
Hence a continual increase in brain RNA 
with age is to be expected. "The increase 
in any particular neurone will occur rela- 
tively rapidly, but the increase within any 
tissue mass will be slow. 

It is clear that the administration of any 
compound to an animal that will inter- 
fere with the RNA-dependent biosyn- 
theses in neurones will affect learning 
and memory. Azaguanine is such a com- 
pound. It is a structural analog of 
guanine (a purine base that is a major 
constituent of both DNA and RNA). It 
is to be predicted that the presence of this 
analog in brain cells will disrupt the in- 
ductive increase in enzymes, due to an 
inhibition of the DNA- and RNA-depend- 
ent processes necessary for enzyme 
biosynthesis, 

The hypothesis predicts increases in 
those molecular species of RNA required 
for the biosynthesis of enzymes involved 
in the metabolism of nerve transmitter 
substances. Other types of RNA will not 
be altered. Now every molecular species 
of RNA probably has a different ratio 
of nitrogenous bases. Consequently, prior 
to induction the total RNA of a cell will 
have a particular base ratio that is the 
mean of all the individual RNA base 
ratios. After induction some types of 
RNA have increased. Hence the base 
ratio of total RNA will now be different. 

It will be seen that the changes in RNA 

metabolism predicted by the enzyme in- 
duction hypothesis are just those that 
have been experimentally reported and 
claimed to support the nucleotide sequence 
hypothesis. Tt is clear that this evidence 


THEORETICAL NOTES 


cannot be used to decide between these 
two alternate hypotheses. ] 

One further piece of experimental evi- 
dence will be considered. It has been re- 
ported by Corning and John (1961) that 
planarians conditioned to avoid light can be 
cut in two and that after regeneration both 
halves retain the conditional response. 
However, if the planarians are allowed 
to regenerate in water containing small 
amounts of ribonuclease (an enzyme cata- 
lyzing the hydrolysis of RNA), only or- 
ganisms regenerated from “heads” retain 
the response. 

These experiments are difficult to in- 
terpret on several grounds. First, ribo- 
nuclease is a protein and may not diffuse 
into the planarians’ cells, or may not diffuse 
equally into all. Second, there is no evi- 
dence as to the quantity and distribution 
of neurones involved in the response. 
Thus on the enzyme induction hypothesis, 
assuming that the response involves some 
interaction between the head ganglia and 
the somatic neurones, certain levels of 
RNA are necessary to maintain induced 
levels of transmitter Substances in particu- 
lar neurone pathways. On treating the 
"tail" with ribonuclease these essential 
RNA levels are reduced to the preinduc- 
tion concentrations. Hence the organism 
regenerated from this portion lacks the 
conditioned response. On either hypoth- 
esis, it is necessary for the ribonuclease 
not to destroy the neurone RNA in the 
head section. Consequently, this experi- 
ment also fails to provide an experimental 
test to distinguish between the two 
hypotheses, 

In summary it may be said that the nu- 
cleotide rearrangement hypothesis of 
memory postulates undemonstrated prop- 
erties of nucleic acids that seem incompat- 
ible with current knowledge, while the 
hypothesis fails to give any satisfactory 
account of how RNA-stored information 
could influence the nervous mechanisms 
of memory. The experimental evidence 
Produced in favor of this hypothesis is 
equally compatible with the hypothesis 
that memory is basically dependent upon 
neurone pathways maintained by high 
levels of transmitter substances due to 
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induced biosynthetic enzymes. This en- 
zyme induction hypothesis is also com- 
patible with reported age and learning 
increases in brain cholinesterase levels: a 
phenomenon not predicted by the nucleo- 
tide rearrangement hypothesis. 
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THE UNTENABILITY OF LUCE'S PRINCIPLE 


WILLIAM W. ROZEBOOM 


Wesleyan University 


In the course of his instructive re- 
searches in the mathematics of psycho- 
physical scaling, R. Duncan Luce (1959a; 
1959b, p. 29) has recently proposed a 
methodological principle which, if true, 
would have revolutionary import not 
merely for psychophysics, but for em- 
pirical and theoretical research in all 
branches of science. Moreover, while 
Luce’s Principle has met with some- 
thing less than enthusiastic acceptance 
in all quarters (cf. Luce, 1959b, p. 90; 
Mosteller, 1958, p. 287), others have 
begun to draw upon it as a basis for 
their own work (cf. Galanter & Messick, 
1961). Such applications are decidedly 
premature, however, for the Principle's 
methodological status is dubious at best. 
As is inevitable for a provocatively novel 
hypothesis, certain ambiguities and un- 
resolved difficulties reside in Luce’s 
initial account, and when these are made 
explicit, it becomes highly problematic 
whether an interpretation can be found 
for the Principle which escapes being 
either vacuously true or empirically false. 

Operating within the context of Ste- 
vens’ theory of scale-types (e.g., Stevens, 
1951); Luce has proposed that the class 
of transformations defined by a scale- 
type should impose limitations on the 
mathematical form of the functional 
relations in which a variable of that type 
can participate. Specifically, it is postu- 
lated (Luce, 1959b) that: 


A substantive theory [i.e., an assumed law] 
relating two or more variables and the meas- 
urement theories Li.e., assumptions about the 
scale-types which are appropriate] for these 
variables should be such that: 


1. (Consistency of substantive and measure- 

it theories) Admissible transformations 

fie. transformations belonging to the class 

es 

atro" also like to voice my reservations 

umber of the conclusions of this 

n Tonei the substance of these 
await another occasion. 


abot 
theory. 
doubts must 


defined by the scale-type] of one or more of 
the independent variables shall lead, via the 
substantive theory, only to admissible trans- 
formations of the dependent variables. 

2. (Invariance of the substantive theory) 
Except for the numerical values of parameters 
that reflect the effect on the dependent 
variables of admissible transformations of the 
independent variables, the mathematical 
structure of the substantive theory shall be 
independent of admissible transformations of 
the independent variables (p. 85). 


Before putting his Principle to work, 
Luce comments that "one can hardly 
question the consistency part of the 
principle," but adds that “the invariance 
part is more subtle and controversial." 
I shall propose, to the contrary, that if 
this verbal statement of the Principle is 
taken at face value, it is the ‘“consist- 
ency" part which is Strong and corre- 
spondingly untenable, whereas the "in- 
variance" part is a weakened version of 
the former which need be no more than a 
mathematical truism. 

What can be meant when, in the con- 
sistency part of his Principle, Luce 
speaks of a transformation of an in- 
dependent variable as leading to, via the 
substantive theory, a transformation of 
the dependent variable? The wording 
suggests that the transformation of the 
one somehow compels or induces a trans- 
formation of the other, an interpretation 
which is strengthened when, in sub- 
Sequent illustration, Luce refers to a 
transformation being "effected on" the 
dependent variable. This cannot be a 
mathematical compulsion, however, for 
there is nothing about a lawful relation- 
Ship which requires us to rescale one 
variable when transforming another. 
Thus given the law, “The area of a circle 
in Square inches is 7 times its squared 
radius in inches,” if I elect to scale length 
in feet rather than inches, my law simply 
becomes “The area of a circle in square 
inches is 1447 times its squared radius 
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in feet." (If in this example the re- 
scaling of length is felt—as it should not 
be—to require a corresponding rescaling 
of area, we could instead take, e.g., the 
relation between a person's height and 
his age.) The apparent meaning of the 
"consistency" stricture comes clearer 
upon examination of Luce's proffered 
illustration: 


Suppose it is claimed that two ratio scales 
are related by a logarithmic law. An ad- 
missible transformation of the independent 
variable x is multiplication by a positive 
constant k, i.e., a change of unit. However, 
the fact that log kx = logk + log x means 
that an inadmissible transformation, namely, 
a change of zero, is effected on the dependent 
variable. Hence the logarithm fails to meet 
the consistency requirement [italics added] 
(1959b, p. 85). 


So worded, the example does not make 
explicit what happens to the dependent 
variable; however, let y be a function of 
x by the relation y = log x, and let 
x! (=aer kx) be the transformed scale of 
the independent variable. Then the 
cited mathematical fact about log kx 
implies that log x’ = log k + log x, or 
y + log k = log x’; so if we wish to find 
a rescaling, y’, of the dependent variable 
which stands in the same relation to x' 
as that in which y stands to x, it has to 
be y’ = y + log k, which is inadmissible 
(says Stevens) for a ratio scale. As 
demonstrated by this example, therefore, 
what the consistency injunction entails 
is that if an admissible transformation is 
made of the independent variable, then 
there must be an admissible rescaling of 
the dependent variable which leaves the 
mathematical expression of the law in- 
variant—i.e., if y =@x, and x’ is an 
admissible rescaling of x, then there is an 
admissible rescaling, y’, of y, such that 
y! = gx’. Other than this, it is difficult 
to see what sense could be made of the 
notion that transformation of the one 
variable "effects" a transformation of the 


other. . 

Before defending this interpretation 
which also needs to be 
considered, it is useful to state explicitly 
just what does happen to a lawful 
relationship, y = $1, under transforma- 


against another 
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Un 


tion of its variables? Let T and U be 
identity preserving (i.e., one-one) trans- 
formations, admissible or otherwise, 
applicable to x and y, respectively. Then 
if x is transformed into x’ by T and y 
is transformed into y’ by U, we have 
x’ = Tx and y' = Uy, or x = Tx’ and 
= U-!y. Hence the substantive 
theory 


Ne: 


y-éóx 
rewritten in terms of x’ and y’, becomes 


UAy = Tox! 
or 


y = USTs! uJ 


Morcover, if it is also the case that 
y' =x’, it follows from Equation 1 
that x’ = UdT x’, or Tx = Uóx. 
Let Ta be an admissible transformation 
of x, and Ug, similarly, an admissible 
transformation of y. Then the consist- 
ency claim of Luce's Principle apparently 
requires that if y = x, the function @ 
must be such that for any admissible 
transformation Ta, there corresponds an 
admissible transformation Ua such that 
for all values of x, 


Tax = Ux [2] 


(It is easily seen that if the inverse of à 
exists, this condition on ¢ is satisfied if 
and only if ọTap™ is an admissible 
transformation of y.) 

It has been proposed to me that what 
Luce intended the consistency part of his 
Principle to assert is only that if an 
admissible transformation, x’ = Tax, is 
made of the independent variable in 
y =x, then there must exist some 
relationship ¢’ and an admissible trans- 
formation, y’ = Uay, of the dependent 
variable such that y' = $'x'. Then to 
derive Equation 2, the invariance part of 
the Principle must be construed to 
stipulate that Ua can always be chosen 
to set $^ = $. But if this is all that the 
consistency claim were to imply, then 
it would have no restrictive force what- 


? Since x may be construed as a vector, the 
analysis which follows applies to multivariate 
relationships as well as to laws involving only 
one independent variable. 
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soever—we can always take $' = T~, 
in which case y requires no transforma- 
tion at all—and hence a case could never 
arise, contrary to Luce s example cited 
earlier, in which the consistency require- 
ment is violated. Moreover, as the in- 
variance part of the Principle is worded, 
it is not strong enough to authorize 
Equation 2 because it insists not on strict 
invariance of ó under admissible trans- 
formations of the variables, but only on 
an invariance of some structure of $. 
But the mathematical form (i.e., struc- 
ture) of a function is hopelessly non- 
unique, and unless additional, as yet 
unspecified, conditions are imposed on 
this to-be-invariant form, we may take it 
to be a form common to any function of 
the class { Usp Tz) —which then leaves 
the invariance condition without any bite. 
In any event, that Condition 2 is, in 
fact, the mathematical nugget which 
Luce (1959b) extracts from the ore of 
his Principle is demonstrated by his 
Table 1. Each entry in the “Functional 
Equation" column may be seen to be an 
instance of Equation 2. In view of the 
stiflingly strong consequences (e.g., Luce, 
1959a, Table 2) of Luce's Principle, what 
can be said to justify it? Despite the un- 
deniable mathematical charm of Equa- 
tion 2, it seems to me that the answer 
must be an uncompromising "nothing 
whatsoever." It will be noticed that 
Luce himself does not actually present 
any argument to support his Principle, 
but in effect merely invites the assent of 
the reader's intuition. But intuition may 
well be gulled by the ambiguities in the 
verbal statement of the Principle into 
acceding to an innocuous reading of it 
which then imperceptibly slides into a 
not-so-innocuous version. This is es- 
pecially likely for the consistency part of 
the Principle, which Luce insists can 
hardly be questioned even while at the 
same time holding it to imply at least 
some restriction on the form of a law. 
(But it has lend Ween end that the 
istency claim is either vacuous or 
dorus tie fui burden of Equation 2 by 
itself.) In any case, in my own methodo- 
logical research I have P e many 
initially self-evident theses wither away 


under careful probing to have much 
confidence in methodological intuition. 

Pending cognitive support for Luce's 
Principle, then, how well does it stand 
the test of applicability to established 
laws? "Well, as Luce himself points out, 
it is not difficult to find laws (see example 
below) which do not conform to it. 
Luce's comments on this faux pas carry 
the discussion into a rather different 
problem of vast proportions, but before 
pursuing his lead, we may pause to 
raise an eyebrow at the methodological 
status of a "principle" which is not 
binding. To be sure, the verbal state- 
ment of Luce's Principle does not say 
that all substantive laws do satisfy the 
Principle, but only that they should 
doso. Itis most unlikely, however, that 
What was intended by this wording is 
merely that conformity to the Principle 
is desirable. Science is concerned with 
what is, not with what ought to be, and 
if the Principle is to have any scientific 
significance, it must be that Condition 2 
Says something about the forms which 
laws are in fact able to assume, and in 
light of which we may limit the forms 
to be considered when Speculating about 
laws hitherto undiscovered. But if there 
are perfectly good laws which violate 
Equation 2, then it is difficult to see how 
the Principle could ever by itself impugn 
the legitimacy of any assumption about 
the form of a problematic law, 

If Luce's Principle is to have any 
methodological force at all, therefore, 
there must exist additional criteria by 
Which we can discriminate between laws 
Which fall under the Principle's sover- 
eignty and those which do not. Luce 
(1959b) has tentatively identified such a 
criterion: “All physical examples which 
have been Suggested to me as counter- 
examples . . , have a common form: 
the independent variable is a ratio scale, 
but it enters th 
less fashion” (pp. 90 ff.). Thatis, the law 
can be put into a form in which the 
independent 
times a Parameter, c, whose numerical 
value is determined by the choice of 
Scale for x, so that any change in unit 
for x may be absorbed into ¢ without 
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disturbing the rest of the law—i.e., the 
law can be written in form y = (x) 
= yV(cx), which under transformation 
x’ = kx, goes over into y = óe(x)), 
where c' = c/h. Luce suggests that we 
can avoid violation of Condition 2 in 
such cases by taking the “dimensionless” 
variable © (=4ef ex) as our independent 
variable; then the law becomes y = y(v), 
in which v has no admissible transforma- 
tions and there are hence no restrictions 
imposed on v by Condition 2. Thus a 
more subtle version of Luce's Principle 
emerges: What is now postulated is not 
that Condition 2 is inviolable, but that 
the laws of nature are always such as to 
be capable of an expression which 
satisfies Equation 2. 

Clearly crucial to this revised formula- 
tion of Luce's Principle, however, is the 
concept of a dimensionless variable. 
For if the method for satisfying Equation 
2 in prima facie embarrassing cases is to 
search out a dimensionless expression of 
the independent variables, the restrictive 
force (if any) of the Principle cannot be 
comprehended until we understand what 
is involved in such a construction. So 
pregnant a topic as this amply merits 
discussion and controversy, which I will 
do my bit to provoke by injecting some 
irritants which, hopefully, may stimulate 
Luce and perhaps others to further 
production of methodological pearls. 

It is my considered opinion that to 
render a variable dimensionless in the 
fashion brought to our attention by Luce 
amounts to nothing but a more or less 
arbitrary selection of one of the admis- 
sible scalings of that variable, and then 
working up an "absolute" interpretation 
for that scale. I contend that such an 
interpretation can be found for at least 
one admissible transformation of any 
variable, no matter what its scale type 
(in fact, I would argue that every scale 
can have an absolute interpretation 
forced upon it), and hence that there is 
no law on whose mathematical form the 
revised version of Luce's Principle could 
impose restrictions. To defend this 
thesis in full generality 1s impractical 
here, so I shall confine my remarks to 
ratio scales. 


545 


An example cited by Luce of a law 
which flouts Condition 2, but which can 
be brought to heel by dedimensionalizing 
it, is the law of radioactive decay. If q 
is the quantity, say in grams, of a certain 
radioactive substance S at any time /, 
say the number of seconds after the 
beginning of the Twentieth century, then 


[3] 


where a and b are empirical constants. 
So formulated, the dependent variable 
(q) is a ratio scale while the independent 
variable (/) is an interval scale, which 
apparently contravenes not only Condi- 
tion 2 (cf. Luce, 1959b, Theorem 4), but 
also Luce's suggestion that the only 
violations of Equation 2 are ones in 
which the independent variables are 
ratio scales. However, it is debatable 
whether generalizations such as Equation 
3, in which a time coordinate assumes the 
status of an independent variable, should 
be regarded as genuine laws (cf. Roze- 
boom, 1961, pp. 356f.). If t is an 
arbitrary point in time on the ? scale, 
and qo is the quantity (on the q scale) 
of S at time fo, then Equation 3 may be 
rewritten as 


q = ae 


q = qoe tt» 


or letting d =aef t — to, 
[4] 


Equation 4, in which all the variables 
are ratio scales, relates the quantity (in 
grams) of substance S at the end of a 
given time interval jointly to the in- 
terval’s duration (in seconds) and S's 
quantity (in grams) at the interval’s 
beginning. We can also express Equa- 
tion 4 asa law with only one independent 
variable by dividing both sides by go and 
setting p —aer g/go. We then have 


q = qe 


[5] 


which shows what proportion of S at a 
given moment still remains d seconds 
later, and in which the dependent vari- 
able, 5, is an absolute, or dimensionless, 
scale. 

Neither Equation 4 nor Equation 5 


pe, 
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isfy Condition 2, but if we now 
vid hosf ae a "half-life," Lr, substance 
S has—i.e., Ln is the value of d for which 
p=.5 in Equation 5—we find that 
La = (In 2)/b, and hence that 


g = (is [6] 


Then letting da be the ratio of d to Li— 
that is, if the time duration variable is 
rescaled with the half-life of Sas the unit 
of measurement—we finally obtain 


gq = G)” [7] 


in which the independent variable is an 
absolute scale with no admissible trans- 
formations, and which is hence immune 
to any threat from Condition 2. This, 
then, is the sort of procedure which, 
suggests Luce, may be used to placate 
Condition 2 when an empirical law 
involving ratio scales (e.g., Equation 4) 
appears to disregard it. 

It should be appreciated, however, 
that to describe a variable such as d; in 
Equation 7 as dimensionless is somewhat 
misleading. The scale is just as much 
dependent upon a unit of measurement 
as it ever was; only now one particular 
unit has been singled out as a preferred 
unit, which determines an absolute scale 
of the variable in the sense (and merely 
in this sense) that there is one and only 
one scale which expresses each value of 
the variable as a multiple of this preferred 
unit. This can obviously be done when- 
ever a preferred unit of measurement has 
been selected, and hence any ratio scale 
law, no matter what its mathematical 
structure, can in this fashion be put into 
a form which is compatible with Equa- 
tion 2. 

There is still one more gambit by 
which some restrictive force might con- 

ceivably be salvaged for Luce's Principle: 


3 Equation 4 violates Luce's Theorem 1 
(1959b), while Equation 5 is especially in- 
i in that since its inverse exists and 
ager dent variable has no admissible 
dn d (other than the Identity 
pies aiii d Condition 2 requires that the 
ien He true of its independent variable. 
5 


If a distinction could be drawn between 
natural units of measurement on one 
hand, and merely conventional ones on 
the other, the definition of dimensionless 
scales could be limited to variables for 
which a natural unit is available. While 
it is my belief that “naturalness” in this 
sense is purely a psychological effect 
with little or no methodological signifi- 
cance, I will not argue the point here. 
Instead, I merely invite anyone who feels 
otherwise to state what unit is the 
"natural" one for dedimensionalizing the 
time-duration variable in the law of 
radioactive decay. There is nothing 
special about the half-life interval. For 
any proportion 7, we could just as well 
take substance S's r-life, L,, for our 
standard (i.e., L, is the time required 
for S to decay to the proportion r of its 
initial amount), and write our law as 


prit [6] 
or 


bcr [7] 


where d, is the L,-scale of duration. 
Preference for r = .5 is wholly a matter 
of psychological convenience. 


SUMMARY 


The strong version of Luce's Principle, 
namely, that all laws must conform to 
Condition 2, is not only unsupported by 
reason, but is refuted by counter- 
examples. On the other hand, a modified 
version of the Principle which demands 
of a law only that it be equivalent to one 
which satisfies Equation 2 (or, somewhat 
less generally, that a law is exempt from 
Equation 2 only if its independent vari- 
ables can be dedimensionalized) is vacu- 
ous insomuch as any law, no matter how 
seriously at odds with Equation 2 in its 
original form, can be coaxed into com- 
pliance with the weakened requirement. 
Until further clarifications and supports 
for Luce's Principle are forthcoming: 
then, it would be questionable strategy 
to limit speculations about possible laws 
to mathematical 


A forms approved by 
Equation 2. 


Jó c 
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COMMENTS ON ROZEBOOM'S CRITICISMS OF “ON THE 
POSSIBLE PSYCHOPHYSICAL LAWS”! 


R. DUNCAN LUCE 


University of Pennsylvania 


The publication of Rozeboom's (1962) 
criticisms of my paper concerning limita- 
tions on scientific laws of certain types 
(Luce, 1959), together with what we both 
feel are misinterpretations and mislead- 
ing uses of those results, prompt me to 
attempt to clarify some of the issues. 
At the outset, let me admit that much 
of the confusion is my fault, due in part 
to an inadequate understanding of the 
problem and in part to a not wholly 
satisfactory presentation of the ideas 
and results. These failures have been 
aggravated by some readers who seem 
to have ignored the second half of that 
paper and, thereby, have oversimplified 
its content. 

Rozeboom takes me to task for an 
ambiguous statement of the two criteria 
that were suggested as a possible “prin- 
ciple of theory construction"—despite 
these ambiguities, which he has discussed 
in detail, we both agree about the 
mathematical “nugget” to be extracted 
from them, namely his Equation 2. We 
both emphasize that there are physical 
laws that either fail to satisfy this equa- 
tion or satisfy it vacuously, and he thus 
concludes, correctly I now believe, that 
it should never have been termed a 
"principle." We also both point out 
that whether Equation 2 has any bite in 
limiting the form of a law has something 
to do with the existence of dimensional 
parameters in the law. We seem to 
differ mainly on the emphasis to place 
upon this distinction. Because I believe 
that it is extremely important, I should 
like to make a few remarks about it. 

Among the various lawful statements 
that can be found in any science, Roze- 
boom and I are restricting our attention 
to what may be called function laws— 


i i t by 
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the assertion of a functional relation 
among several variables, each of which 
can be represented as a simple numerical 
scale with certain invariance properties 
dictated by its measurement theory. As 
a specific example, from which we will 
generalize, consider the familiar law 
E = mc, where E denotes energy, ?? 
denotes mass, and ¢ the velocity of light 
in a vacuum, all measured in some con- 
sistent set of units. It will be convenient 
to think of this law in the form Æ — mc 
= 0. The first thing involved in this 
physical law is the assertion that there 
exists a particular function of three var- 
iables, call it y, such that all physically 
realizable values of E,m, and c satisfy 


W(E, m, c) = 0 


But there is a good deal more intended 
than just this simple mathematical 
equation. For example, not all of the 
arguments in the equation are of the same 
type: they are constrained in different 
ways. We usually speak of E as the 
dependent variable, m as the independent 
variable, and c as a dimensional pa- 
rameter. By these words we mean such 
things as this: that certain of the 
symbols can assume arbitrary (positive) 
values and that the law then constrains 
others to have particular values, that 
other symbols represent constants that 
cannot be experimentally manipulated, 
and that certain measurement trans- 
formations of some of the symbols can be 
carried out provided that the trans- 
formations are admissible according to 
the measurement theories for these 
variables and that certain variables and 
parameters are transformed in prescribed 
ways. In other words, the statement of 2 
law involves not only the usually written 
equation, but also certain specifications 
about which values can be arbitrarily 
selected, about which variables can be 
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freely transformed, and about which 
values and transformations are pre- 
scribed when these choices are made. 
To generalize from the example, we 
distinguish three kinds of symbols that 
enter into our functional relation (the 
distinctions among them are rendered 
more precise later): There are m+ 1 
variables, one of which is designated as 


1. the dependent variable, y, and it 
assumes values in a set Y of real numbers, 
and the other m > 1 are designated as 

2. independent variables, x;, and they 
assume values in, respectively, sets Y; of 
real numbers; in addition, there are 

3. n > 0 dimensional parameters, aj. 
Associated with each of these quantities 
is a group of admissible transformations. 
In the case of a variable, the group is 
determined by its measurement theory 
whereas, in the case of a parameter, it is 
simply included as part of the statement 
of the law. We denote these groups by 
Y, X; and A; The more common 
transformation groups found in science 
include the affine group (multiplication 
by a positive constant), the full linear 
group, and the group of strictly mono- 
tonic increasing transformations—corre- 
sponding, respectively, to what are 
called ratio, interval, and ordinal scales. 

Once the variables, parameters, and 
transformation groups are listed, the 
statement of a law then involves two 
distinct assertions. First, a particular 
function V of 1+ m+ n variables is 
given which has the property that for 
any physically realizable choice of x;eX;, 
i—1,2,::-,m,any physically realizable 
value y(xi, +++, v,)eY of the dependent 
variable satisfies 
y Drs, -- sa] 

=0 


5,86) vi May, * <2 les gu 8 


Second, for any set of (admissible) 
transformations T;eX;, i = 1, 2, +++, m, 
of the independent variables, there 
exists an (admissible) transformation 
UT, ..., T,)eY of the dependent vari- 
able and a set of (admissible) trans- 
formations S;(T,, +++. Tn)eAs, j = 1,2, 
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‘n, of the dimensional parameters 
such that 


VO xn ct xu 0 7,2) = 0 
implies 
vEUGOn, -- i Fads, Tix is, TaXn; 
SUT, 9, Tray, ==, 
Sni(Ti, +++, Tata] = 0. 


The first part of such a law, the func- 
tion relating the variables, is always 
stated explicitly, but the second part is 
often implicit. It is simply a convention 
that scientists know and take for 
granted. For example, if a decay law for 
à particular radioactive material is 


q = qx? 


when the time ¢ since a certain event is 
measured in seconds, then everyone 
knows that if we measure time in hours 
we must change the time constant from 
0.14 to (0.14) (360) = 50.4. 

Despite the fact that they are often 
left implicit, these transformation prop- 
erties are important and must be made 
explicit in a full statement of a law. 
Indeed, one of the things that dis- 
tinguishes theoretical science from a 
collection of simple numerical assertions 
are the relations among admissible 
transformations. In addition, the rela- 
tions among transformations make the 
distinctions among the arguments of a 
law more than just linguistic conventions. 
Note that the independent variables are 
the arguments of V whose values in the 
first part and whose transformations in 
the second are selected arbitrarily; the 
value and transformation of the de- 
pendent variable are both determined 
via the law by those of the independent 
variable; and the value of a dimensional 
parameter is independent of the values of 
the variables and its transformation is 
determined by those applied to the 
independent variables. It is often pos- 
sible to restate the law so that a different 
variable is treated as dependent; but 
this is not always possible, as for ex- 
ample, when the dependent variable is a 
periodic function of the independent 


ones. 


550 


Now consider a law (such as F — ma) 
that includes no dimensional parameters, 
and suppose that the solution y of 


yo, xc 


is unique, i.e., there is a function $ of m 
variables such that 


y= olr 


solves the above equation. Then the 
second part of the law can be rewritten 


+, tm) = 0 


» Xm) 


as: for any TyeXi, i = 1, 2, +++, m, then 
there exists U(Ti, +++, T4)eY such that 
if 
y = (m1, +++, Xm) 
then 
U(T3, e To)y = Oo (Pits, +++, Tox) 


Substituting the first of these equations 
into the second, we obtain the following 
restriction on the function 


THD: 7. Fel Gay 7*2) 
= (Titi, my Tas) 


which is Rozeboom's Equation 2 for m 
independent variables; it includes the 
several special cases investigated in Luce 
(1959). For ratio and interval scales 
and m = 1, it was shown that, aside 
from numerical constants, there are very 
few possible solutions $ to this functional 
equation. 

When dimensional parameters are ad- 
mitted, the resulting equation is very 
much less restrictive. Specifically, with 
one independent variable and one dimen- 
sional parameter, the same argument 
leads to 


U(T)b(x, a) = à Tx, S(T)a] 


which has many possible solutions. For 
example, suppose that both y = $(x, a) 
and x are variables measured on ratio 
scales, and suppose that f is an arbitrary 
function of one variable. A solution to 
this equation is (x, a) = f(xa) provided 
that when x is transformed by Tx — kx, 
then U is the identity transformation 
and S = T~, i.e., Uy = y and Sa = a/k. 

Two conclusions seem justified. First, 
Rozeboom is correct in saying that the 
equation that results when there are no 
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dimensional parameters is much too 
restrictive to be called a principle of 
theory construction. Second, and this 
I feel to be the main thrust of both my 
1959 paper and this note, psychologists 
(as well as other scientists) either are 
restricted to a very few possible types of 
laws (in the case of one independent 
variable, to those given in my 1959 
paper when the variables are measured 
on either ratio or interval scales) or they 
cannot avoid including dimensional pa- 
rameters in the statement of their laws. 
Not only must these parameters be 
written explicitly, but a careful prescrip- 
tion should be included about how they 
transform as a function of admissible 
transformations of the independent vari- 
ables, for otherwise the statement of the 
law is incomplete. Following the prac- 
tice of physicists, the transformational 
aspects of laws are often not made 
explicit. This does not seem to create 
serious problems when the transforma- 
tions are limited to the affine group 
(ratio scales), but matters are much less 
obvious when interval and weaker scales 
areinvolved. In these cases, which often 
seem to arise in psychology, it is essential 
that careful attention be given to the 
transformation properties of the vari- 
ables and parameters. 

Finally, the problem remains whether 
or not the number of dimensional pa- 
rameters involved in the statement of a 
law is of any inherent importance. Some 
physicists seem to feel that such pa- 
rameters are undesirable and that their 
total number in a science should be held 
to a minimum. My earlier results 
strongly suggest that the minimum can- 
not possibly be zero. Although psy- 
chologists have hardly begun to face 
these problems, inevitably they will. At 
that time, it would be useful to have 2 
better understanding of the role of 
dimensional parameters than we have 
now. In physics they have meaning— 
the velocity of light, the density of 2 
substance, the viscosity of a fluid, the 
gravitational constant, etc.—and they 
recur in various combinations in different 
laws. As yet, we have little of this !? 
psychology. 
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SUMMARY 


It is pointed out that certain scientific 
laws assert not only a functional relation 
among variables and dimensional pa- 
rameters, but that they also state what 
transformations on the dependent vari- 
able and the parameters are necessary 
for the same function to be satisfied 
when admissible transformations are 
applied to the independent variables, 
It is shown when there are no dimen- 
sional parameters, these transformation 
conditions are very restrictive but that 
they are not when one or more dimen- 
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sional parameters are present. The con- 
clusion is that psychological laws will in 
general involve dimensional parameters 
and that it is necessary to state carefully 
the transformational aspects of the law. 
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COMMENT 


WILLIAM W. ROZEB 


Wesleyan University 


Iam grateful to the Editor for allowing 
me this opportunity. to express. my 
ure at Luce's gracious atceptance of 

E of his earlier paper, and to 
cd = my full endorsement of his 
et conclusions. Briefly, the point is 
P this: If a law is stated without 
nie TA of one particular' scale for 
each of the variables involved, and if, 
moreover, it is intended that alternative 
scaling of these variables be counte- 
nanced, then it is also necessary to make 
explicit provision in the law's statement 
for the manner in which the contemplated 
transformations will affect it. In par- 
ticular, if it is assumed that the function 


relating the variables is invariant under . 


à 
. 


ey 
on 


% 


2 = 
In the article by Frank Restle entitled “The Selection of Strate- pa 
gies in Cue Learning” (Psychol. Rev., 1962, 69, 329. 
equation at the, top of page 333 should read: 


M N.N; 


"In E 
issue, th 2 | 


formational behavior also require study 


$ ak 
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these transformations—and I think that 


Luce is quite right in his judgment that | 
this assumption is normally made, though | 


I am much less convinced of its methodo- _ 
logical necessity—then it will usually be 
necessary to include scale parameters in 
the law whose meanings and trans- 
and clarification. 


Thanks to Luce, 


n" : » " 
explicit awareness of this need has now 


been entered in the mainstream of psy- | 
chological literature, and a few more 


bolts on psychology’s shaky methodo- . 
logical scaffolding have begun to tighten: 
E 
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